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NOVEL PLEXINS AND USES THEREOF 
BACKGROUND OF THE INVENTION 

Field of the Invention 

5 The invention relates to the identification and characterization of four novel 

proteins that are members of the plexin family. 
Summary of the Related Art 

Plexin family members encode large transmembrane proteins, whose cysteine- 
rich extracellular domains share regions of homology with the Scatter Factor receptors 

10 (encoded by the Met gene family). The extracellular domains of plexins also contain 
-500 amino acid Semaphorin domains (see below). The highly conserved cytoplasmic 
moieties of plexins (approx. 600 amino acids), however, have no homology with the 
Met tyrosine kinase domain, nor with any other known protein. Met-like receptors and 
their ligands, the Scatter Factors, mediate a complex biological program including 

15 dissociation of cell-cell contacts, motility and invasion (for a review see Tamagnone, L. 
and Comoglio, P.M. (1997) "Control of invasive growth by hepatocyte growth factor 
(HGF) and related scatter factors." Cytokine Growth Factor Rev 8, 129-142). During 
embryogenesis Scatter Factor- 1 and Met promote the dissociation of cell layers in the 
somites and drive the migration of myogenic cells to their appropriate location (Bladt, 

20 F., Riethmacher, D., Isenmann, S., Aguzzi, A., and Birchmeier, C. (1995) "Essential 
role for the c-met receptor in the migration of myogenic precursor cells into the limb 
bud." Nature 376, 768-771; Maina, F., Casagranda, F., Audero, E., Simeone, A., 
Comoglio, P.M., Klein, R.a., and Ponzetto, C. (1996) "Uncoupling of grb2 from the 
met receptor in vivo reveals complex roles in muscle development." Cell 87, 531-542). 

25 Met and Scatter Factor- 1 are also involved in controlling neurite outgrowth and axonal 
guidance (Ebens, A., Brose, K., Leonardo, E.D., Hanson, M.G.J., Bladt, F., Birchmeier, 
C, Barres, B.A., and Tessier-Lavigne, M. (1996) "Hepatocyte growth factor/scatter 
factor is an axonal chemoattractant and a neurotrophic factor for spinal motor neurons." 
Neuron 17, 1157-1172; Maina, F., Hilton, M.C., Ponzetto, C, Davies, A.M., and 

30 Klein, R. (1997) "Met receptor signaling is required for sensory nerve development and 
HGF promotes axonal growth and survival of sensory neurons." Genes Dev 11, 3341- 
3350; Maina, F., Hilton, M.G., Andres, R., Wyatt, S., Klein, R., and Davies, A.M. 
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(1998) "Multiple roles for hepatocyte growth factor in sympathetic neuron 
development." Neuron 20, 835-846). 

The first clue regarding a possible function for plexins came from the finding 
that a novel plexin, Vespr, interacts with the viral semaphorin A39R (Comeau, M.R., 
Johnson, R., DuBose, R.F., Petersen, M., Gearing, P., VandenBos, T., Park, L., Farrah, 
T., Buller, R.M., Cohen, J.I., Strockbine, L.D., Rauch, C., and Spriggs, M.K. (1998) "A 
poxvirus-encoded semaphorin induces cytokine production from monocytes and binds 
to a novel cellular semaphorin receptor, VESPR." Immunity. 8, 473-482). Semaphorins 
are a large family of secreted and membrane-bound molecules that are characterized by 
an extracellular domain containing a -500 amino acid Semaphorin domain (Kolodkin et 
al. (1993) "The semaphorin genes encode a family of transmembrane and secreted 
growth cone guidance molecules." Cell 75, 1389-1399). As noted above, plexins 
contain a more divergent but nevertheless conserved Semaphorin domain. 

Semaphorins were originally characterized in the nervous system, where they 
have been implicated in repulsive axon guidance (Kolodkin et al. (1993) supra', Luo, 
Y., Ruble, P., and Raper, J.A. (1993) "Collapsin: a protein in brain that ind uce s the 
collapse and paralysis of neuronal growth cones." Cell 75, 217-227; Tessier-Lavigne, 
M. and Goodman, C.S. (1996) "The molecular biology of axon guidance." Science 
274, 1123-1133). More recently, semaphorins have been furthermore implicated in 
cardiac and skeletal development (Behar, O., Golden, J.A., Mashimo, H., Schoen, F.J., 
and Fishman, M.C. (1996) "Semaphorin HI is needed for normal patterning and growth 
of nerves, bones and heart." Nature 383, 525-528), in the immune response (Hall, K.T., 
Boumsell, L., Schultze, J.L., Boussiotis, V.A., Dorfman, D.M., Cardoso, A.A., 
Bensussan, A., Nadler, L.M., and Freeman, G.J. (1996) "Human CD100, a novel 
leukocyte semaphorin that promotes B-cell aggregation and differentiation." 
Proc.Natl.Acad.Sci.U.S.A. 93, 11780-11785), in the regulation of angiogenesis (Miao, 
H.Q., Soker, S., Feiner, L., Alonso, J.L., Raper, J.A., and Klagsbrun, M. (1999) 
"Neuropilin-1 mediates collapsin- 1/Semaphorin HI inhibition of endothelial cell 
motility. Functional competition of collapsin- 1 and vascular endothelial growth factor- 
165" [In Process Citation]. J Cell Biol 146, 233-242), and in tumor growth and 
. metastasis (Christensen, C.R., Klingelhofer, J., Tarabykina, S., Hulgaard, E.F., 
... Kramerov, D., and Lukanidin,JEL (1998) "Transcription of a novel- mouse semaphorin 
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gene, M-semaH, correlates with the metastatic ability of mouse tumor cell lines." 
Cancer Res. 55, 1238-1244). 

Previously identified plexins have been shown to be expressed in the developing 
nervous system, (i.e. Plexin-A is a receptor for class 1 semaphores (Sema-la and 
Sema-lb). Moreover, Plexin-A has been shown via genetic analysis to control motor 
and CNS axon guidance induced by semaphores (Winberg, MX., Noordermeer, J.N., 
Tamagnone, L., Comoglio, P.M., Spriggs, M.K., Tessier-Lavigne, M., and Goodman, 
C.S. (1998). Plexin A is a neuronal semaphorin receptor that controls axon guidance. 
Cell 95, 903-916). 

Thus a need exists for discovery of other members of the plexin family of 
proteins. 

SUMMARY P i? THF. INVENTION 

The present invention provides four novel plexin family members. 
In one aspect, the invention provides an isolated nucleic acid having at least 
80% nucleic acid sequence identity to a nucleotide sequence that encodes an amino acid 
-sequence selected-from the group consisting of the amino.acid sequence shown in (SEQ 
ID NO: 2), (SEQ ID NO: 4), (SEQ ID NO: 6) and (SEQ ID NO: 8). 

In other aspects, the invention provides isolated nucleic acid having at least 
80% nucleic acid sequence identity to a nucleotide sequence selected from the group 
20 consisting of the nucleotide sequence shown in (SEQ ID NO: 1), (SEQ ID NO: 3), 
(SEQ ID NO: 5) and (SEQ ID NO: 7). 

In another aspect, the invention provides a vector comprising the nucleic 

acid of the above-aspects. 

The invention also provides an isolated polypeptide the amino acid 
25 sequence of which comprises residues 1-18, 19-518, 451-530, 601-680, 751-830, 
800-1010, or 1196-1215 of SEQ ID NO: 2; 1-23, 24-507 or 1100-1119 of SEQ ID 
NO: 4; or 1-42, 43-600, 541-620, 691-770, 831-910, 900-1110 or 1270-1293 of SEQ 
ID NO: 6; or 8-49, 154-199 or 1-199 of SEQ ID NO: 8. 

In another aspect, the invention provides an isolated polypeptide having at 
least 80% amino acid sequence identity to an amino acid sequence selected from the 
group consisting of the amino acid sequence shown in (SEQ ID NO: 2), (SEQ ID NO: 
4), (SEQ ID NO: 6) and (SEQ ID NO: 8) 
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The invention also provides a chimeric molecule comprising a polypeptide of 
the above aspects fused to a heterologous amino acid sequence. In one embodiment 
the heterologous amino acid sequence is a Fc region of an immunoglobulin. 
In other, aspects, the invention provides an antibody that specifically 
5 binds to the polypeptides of the above aspects. 

The invention also provides a method of treating, suppressing or altering a 
disorder involving aberrant immune regulation involving a signaling pathway between a 
plexin and a neuropilin in a mammal comprising the step of administering an effective 
amount of an agent to said mammal capable of interfering with the association between 
10 the plexin and neuropilin. Contemplated agents include a chimeric molecule or an 
antibody of the above aspects. 

DESCRIPTION OF THE DRAWINGS 

Figure 1. 

(A) Phylogenetic tree of human plexins. Known family members cluster in two 

15 major groups: plexin A and plexin-B subfamilies. (B) Structural features of plexins, 

Met-Iike receptors and semaphorins. In the extracellular moieties, yellow boxes indicate 
the "semcT domains and blue boxes mark the cysteine-rich MRS motifs, some of which 
are stippled to indicate their atypical sequence; atypical MRS are also found in the sema 
domain of semaphorins. Sequence identity among sema domains ranges from 15-50%, 

20 as previously described (see Winberg et al., 1998 supra). Potential furin-like 

proteolytic sites are marked by red ribbons. plexin-B 1 "truncated" is the product of a 
splicing variant (see text). plexin-Dl and plexin-Cl (VESPR) are more distant family 
members, since they include atypical -features in theirextracellular domains. The 
intracellular domain of plexins (SP domain) is highly conserved through all family 

25 members, and it includes two separate regions of high homology (Maestrini, E., 

Tamagnone, T., Longati, P., Cremona, O., Gulisano, M., Bione, S., Tamanini, F., Neel, 
B.G., Toniolo, D., and Comoglio, P.M. (1996) "A family of transmembrane proteins 
with homology to the MET-hepatocyte growth factor receptor." Proc. Natl. Acad. Sci. 
USA 93, 674-678) (green oval and box). Met-like receptors'are disulfi de-bound 

30 heterodimers and include a cytoplasmic tyrosine kinase domain (red box). Mammalian 
semaphorins can be either secreted or cell surface proteins. Molecular weights of 
" * "representative proteins are Plexin- A 1 220 kDa, Plfexin-B 1 250 kDa, Plexin^Cl 200 
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kDa, HGF-R/Met 145+45 kDa (heterodimer), Sema 4D 150 kDa (transmembrane), 
Sema7A approx. 100 kDa (membrane GPI-linked). 
Figure 2. 

(a) Cell surface semaphores specifically bind human plexins. Micropraphs of 
the binding assays done testing i) the extracellular domain of semaphorin CD100 fused 
to alkaline phosphatase (CD100-AP) on COS cells transfected with plexin-Bl cDNA; 
ii) control AP on plexin-Bl; hi) CD100-AP on plexin-B2; iv) CD100-AP on the entire 
extracellular domain of plexin-Bl; v) CD100-AP on isolated "plexin-Bl truncated" 
(including sema domain, 1° and 2° MRS); vi) CD100-AP on a "plexin-Bl -Asema" 
(including 2° and S° MRS-; -vii) extracellular domain of semaphorin A39R fused to AP, 
on plexin-Cl (Vespr); viii) SemaKl-AP on plexin-Cl . The final detection of the 
binding was done either using alkaline phosphatase substrates (i-vi) or by 
immunofluorescence (vii and viii). (B) Scatchard analysis and binding curve of 
CD100-AP to plexin-Bl (K D = 0.9 nM ± 0.15). 

15 Figure 3. 

. Plexins ^^^j^lth_neuiomUns via specific extracellular domains. Western 
blots of immunoprecipitated samples from cells co-expressing neuropilins and plexins. 
Specific MoAbs were used, directed against the VSV-tag included in plexins or the 
myc-tag included in neuropilin-2 (Np2, 130 kDa). Np2 co-immunoprecipitates with 
20 plexins, such as plexin-A3 (220 kDa), the extracellular domain of plexinAl 

(approx.160 kDa), and plexin-Bl (250 kDa) but not with the unrelated cell surface 
receptor DCC (170 kDa). Np2 can also associate a truncated form of the extracellular 
moiety of plexin-Bl ("plex-Bl trunc.'\ approx. 1 10 kDa), containing the sema domain. 
Figure 4. 

Expression of mRNAs for plexins Al (panel A, B), -A2 (panel C, D) and A3 
(panel E, F) in the spinal cord (sc), dorsal root ganglia (d) and sympathetic ganglia (sg) 
of E13.5 mouse embryos. Expression of the mRNAs was detected by RNA in situ 
hybridization. Scale bar.l /mi. 
Figure 5. 

Effect of a truncated plexin-Al construct (lacking the cytoplasmic domain) on 
repulsive and attractive responses of Xenopus spinal neurons to Sema3A and netrin-1 . 
(A-F) A control spinal neuron exposed to a gradient of Sema3A emanating from a 
pipette (A) is repelled away over a period of 1 hr (B). In contrast, a GFP-expressing 
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spinal neuron from an embryo injected with mRNA for the truncated plexin-Al 
construct (C) is not affected by Sema3 A (D). A similar neuron (E) shows a normal 
attractive response to netrin-1 (F). 

(G) Cumulative distribution plot of turning angles for all the neurons studied. 
5 Curves show the percent of neurons with turning angles less than the angle indicated on 
the abscissa, under different conditions (open circles, control neurons; black and blue 
circles, control neurons responding to Sema3A or netrin-1, respectively; red and green 
circles, responses of neurons expressing the truncated plexin-Al construct to Sema3A 
and netrin-1, respectively. (H) Mean turning angle under all the conditions just 
10 mentioned. — 
Figure 6. 

Tyrosine phosphorylation of plexin-A3 and plexin-B 1. (a) Anti- 
phosphotyrosine western blotting of immunoprecipitated p220 plex ~ A3 and p250 plex 81 
proteins. plexin-B 1 is larger since it contains an extra sequence between the second and 

15 the third MRS motif, in the extracellular domain (see Fig. 1). (b) The same 
immunoprecipitated samples underwent in v/rrakinase assay in the_presence of 
[y 32 P] ATP , Mg** and Mn^ ions. The SDS-PAGE was treated with alkali in conditions 
known to eliminate the phosphate labeling of Ser/Thr residues and specifically 
preserving phosphotyrosines. 

20 Figure 7 

Plexin-A3 overexpression mediates cell repelling cues, (a) Epithelial kidney 
MDCK cells transfected to overexpress plexin-A3 (or mock transfected) were 
cocultured with mesenchymal KJ-29 or NIH-3T3 cells. After 16-30 hours, mixed - 
cultures of control cells (upper panels) reached confluency and stopped growing: 

25 typically the epithelial cells formed islets (circled) surrounded by a fibroblasts lawn. In 
contrast, MDCKs overexpressing plexin-A3 (lower panels) overwhelmed the adjacent 
mesenchymal cells. The latter withdrew and selectively detached from the culture dish 
(dying cell clusters are indicated by arrowheads), and eventually epithelial cells only 
survived. To allow an easier detection of the mesenchymal cells, these were labeled 

30 with Dil before being plated in mixed cultures, (b) Plexin-A3 expressing cells do not 
induce apoptotic signal on repelled fibroblasts. Mixed cultures of NIH 3T3 and control 
orplexin-A3 overexpressing MDCKs were tested for the presence of TUNEL-AP 
positive cells. Apoptotic cells were not present in clusters of repelled cells (indicated by 
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arrows). The right panel shows a positive control where apoptosis was induced on the 
same cells by UV treatment, (c) Plexin-A3 over-expressing cells form very transient 
contacts with fibroblasts. Time-lapse video-microscopy of control and plexin-A3 
overexposing MDCK cells grown in presence of fibroblasts. On top, snap-shot 

5 images from the movie, taken every 50 minutes (real time). In the upper row is shown 
the persistent contact of a fibroblast (marked by an arrow) with an islet of control 
MDCK cells (marked by a star). In the lower row another fibroblast, instead, forms a 
transient contact with an islet of plexin-A3 transfected cells, which also, in turn, 
reshapes. At the bottom, the diagrams show the relative frequency of persistent, 

10 transient or very transient contacts between fibroblasts and MDCK cells. 

TYFTATTTTX) DESCRIPTION OF THE I NVENTION 
The reference works, patents, patent applications, and scientific literature, 
including accession numbers to GenBank database sequences, that are referred to herein 
establish the knowledge of those with skill in the art and are hereby incorporated by 

15 reference in their entirety to the same extent as if each was specifically and individually 
in dicated to be inc orporated by rrference. Any conflict between any reference cited 
herein and the specific teachings of this specification shall be resolved in favor of the 
later. Likewise, any conflict between an art-understood definition of a word or phrase 
and a definition of the word or phrase as specifically taught in this specification shall be 

20 resolved in favor of the latter. 

Four novel human plexins have been identified: plexin-B2, plexin-B3, plexin-Dl 
and Plexin A-4. Plexin-A4 is located on human chromosome 7 and is a family member 
of the plexin-A subfamily which also includes plexin-Al (alternatively named plexin- 
1/NOV), plexin-A2 (alternatively named plexin-2/OCT) and plexin-A3 (alternatively 
named plexin-2/SEX). Plexin-B2 and plexin-B3 are located on human chromosome 22 
and chromosome X, respectively, and are family members of the plexin-B subfamily 
which also includes plexin-Bl (alternatively named SEP). Plexin-B3 maps very close 
to the plexin-A3 genomic locus on Xq28. Plexin-Dl is the first identified member of 
the plexin-D subfamily and is atypical of any of the other subfamilies. A fourth 
subfamily of plexins, the plexin-C subfamily, is defined by VESPR (now plexin-Cl). 

The four novel plexins as described herein have in their extracellular domains 
regions of homology^ two other protein. families; (a) Scatter Factors Receptors, 
encoded by the MET oncogene family (Tamagnone and Comoglio, 1997 supra), and (b) 
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Semaphorins (Kolodkin et al. (1993) supra (Figure lb). In particular, plexins and Met- 
Iike receptors contain short cysteine-rich motifs, termed "Met Related Sequences" 
(MRS), whose minimal consensuses: C-X(5-6>C-X(2>C^X(6.8)-C-X(2)-C-X(3-5)-C 
(Maestrihi et al., 1996 i supra); Tamagnone and Comoglio, 1997 supra); blue boxes in 
5 Fig. IB). The proteins of the Met family contain a single MRS (in their receptor (3 
chains), whereas in plexin family members there are two/three repeated MRS motifs. 
Furthermore, all plexin family members contain in their extracellular moiety a 500 
amino acid region similar to the sema domain of semaphorins (Kolodkin et al. (1993) 
supra; Winberg et al., 1998 supra); yellow boxes in Fig. IB. The MRS motif is 
10 ' proposed to function as protein-protein interaction domain. 

The cytoplasmic domain of plexins contains a -600 amino acid domain which we 
term the SP domain ("Sex and Plexins", marked in green in Fig. IB) that is highly 
conserved within the family (57-97% similarity) and in evolution (over 50% similarity 
between invertebrates and humans). The SP domain does not share homology with any 
15 known protein. It includes a number of potential tyrosine phosphorylation sites, but 
lacks-the typical jnotifs of catalytic tyrosine kinases. Inte rest ingly, the pr edicte d 
secondary structure of the SP domain includes long conserved alpha helices, typically 
found in protein-protein interaction modules. Furthermore, there are several 
dihydrophobic amino acid motifs (such as LL or LI), known to mediate the 
20 internalization and downregulation of transmembrane receptors (Sandoval, IV. and 
Bakke O. (1994). Targeting of membrane proteins to endosomes and lysosomes. 
. Trends in Cell Biology 4, 292-297). 

The present invention also demonstrates that plexins can form complexes with 
neuropilins, which in. turn demonstrates that a receptor for semaphorins can be hetero- 
25 oligomers of plexins and neuropilins. As demonstrated by in situ mRNA expression 
analysis, plexins and neuropilins are in fact simultaneously expressed in several 
neuronal populations during embryonic development. The plexin-neuropilin complex 
predates ligand binding, since the association is not influenced by the presence of class 
3 semaphorins. That the observed plexin-neuropilin complexes are formed in cis is. 
30 furthermore supported by the experimental conditions used (cotransfection of isolated 
cells with the two constructs). An interaction in trans might also be envisioned 
(considering that plexins and semaphorins share similar sema domains), however by , 
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analyzing mixed cultures of cells separately transfected with plexins and neuropils we 
did not isolate associated complexes (data not shown). 

We observed that the main semaphorin binding domain of neuropilins (CUB 
domain (Giger, R.J., Urquhart, E.R., Gillespie, S.K., Levengood, D.V., Ginty, D.D., and 
Kolodkin, A.L. (1998) "Neuropilin-2 is a receptor for semaphorin IV: insight into the 
structural basis of receptor function and specificity." Neuron 21, 1079-1092; 
Nakamura, F., Tanaka, M, Takahashi, T., Kalb, R.G., and Strittmatter, S.M. (1998) 
"Neuropilin-1 extracellular domains mediate semaphorin D/ffl-induced growth cone 
collapse" [In Process Citation]. Neuron 21, 1093-1100; Chen et al. 1998 supra) is not 
required for the interaction with plexins, as indicated by the association of the relevant 
Neuropilin-2 deletion construct with plexin-Bl (not shown). A ternary complex, where 
neuropilins use two distinct protein modules to form a bridge between the sema domain 
of semaphores and the sema domain of plexins is thus contemplated. It is further 
contemplated that plexins are the functional partners of neuropilins, required for 
transducing signals mediated by semaphoring preferably class 3 semaphores. For 
example, in flies, which lack both, neuropilins and class 3 semaphorins, D Plex A 
appears sufficient as a functional receptor for Sema la, a transmembrane class 1 
semaphorin (Winberg et al., 1998 supra). Further support that plexins are functional co- 
receptors for secreted semaphorins is demonstrated in an experiment that shows that a 
truncated plexin-Al construct expressed in Xenopus spinal neurons abolishes repulsive 
responses to Sema3A without markedly affecting attractive responses to netrin-1. These 
results are consistent with the involvement of plexins. 

The intracellular signals transduced by plexins are still largely obscure. The 
cytoplasmic domain of plexins is large and highly conserved within and across species 
and contains stretches of alpha helices, which are putative protein-protein interaction 
domains, and could thus mediate the association with cytosolic partners. We 
demonstrate herein that the cytoplasmic domain of plexins can be tyrosine 
phosphorylated, suggesting that, like other receptors devoid of intrinsic catalytic 
activity, plexins may signal by associating a tyrosine kinase (Stahl, N. and 
Yancopoulos, G.D. (1993). The alphas, betas, and kinases of cytokine receptor 
complexes. Cell 74, 587-590; Glass, D.J., Bowen, D.C., Stitt, T.N., Radziejewski, C, 
Bruno, J., Ryan, T.E., Gies, D R., Shah, S., Mattsson, K., Burden, 8* DiStefano, P.S., 
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Valenzuela, D.M., DeChiara, T.M., and Yancopoulos, G.D. (1996). Agrin acts via a 
MuSK receptor complex. Cell 85, 513-523) . 

. In addition, we show herein that expression of plexins, particularly plexin-A3, 
mediates cell-repelling cues. By time-lapse video-microscopy we observed a true 
5 repelling effect on fibroblasts. Intriguingly, we observed that -upon interaction with 
fibroblasts- also the islets of plexin-A3 MDCKs at times reshaped. This may be 
explained by the existence of intra-epithelial repelling cues, balanced by the attractive 
forces exerted by epithelial cell junctions. 

Moreover we have demonstrated that in the nervous system (i.e. Drosophila), that 
10 defasciculating motor axons co-express both Plexin A and one of its interacting 
) partners, the transmembrane semaphorin Sema-la (Winberg et ah, 1998 supra). This 

demonstrates that plexins act in vivo either as receptors or ligands for cell surface 
semaphorins, which in turn can transduce intracellular signals, as reported for ephrins 
(Holland et ah, 1996 supra), Semaphorins, therefore, besides being pivotal in axon 
15 guidance, have a general role in morphogenesis and tissue remodeling by mediating 
cell-repelling cues 

Accordingly, in a first aspect, the invention provides an isolated nucleic acid 
molecule encoding a novel human plexin polypeptide. By "plexin polypeptide" is meant 
a member of the plexin family comprising an amino acid sequence that shares at least 
20 60% amino acid sequence homology with SEQ ID NOS: 2 (plexin B-2), 4 (plexin B-3), 6 
(plexin D-l) or 8 (plexin A-4), preferably, at least 65% sequence homology, more 
) preferably, at least 70% sequence homology, more preferably, at least at least 75% 

sequence homology, more preferably, at least 80% sequence homology, still more 
preferably at least 85% sequence homology, even more preferably, at least 90% sequence 
25 homology, and most preferably at least 95% sequence homology with SEQ ID NOS: 2, 4, 
6 or 8. Plexin polypeptides of the invention are useful for modulating cell growth (i.e. 
nerve) and immune regulation. 

As used herein, by "modulating" is meant increasing or decreasing cell growth. 
By "cell growth" is meant any change in cell number or size, including, without 
30 limitation, increase or decrease in cell number, increase or decrease in rate of cell 

division, increase or decrease in rate of cell death, and/or increase or decrease in cell size, 
- — Standard methods fonmeasuring cell growth include standard apoptosis assays (e.g., 

TUNEL assays, DNA fragmentation, trypan blue exclusion) and cell proliferation assays 
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{e.g., 3 H-thymidine incorporation). It will be appreciated that the degree of modulation 
of cell growth provided by a plexin polypeptide in a given assay will vary, but one of 
skill in the art can readily determine the statistically significant change in cell growth of a 
cell exposed to a plexin polypeptide. 
5 By "immune regulation" is meant increasing or decreasing the biological 

functions of immune cells {i.e., cells involved in an immune response). Immune cells 
include, without limitation, lymphocytes (T and B), NK cells, dendritic cells, myeloid 
cells {e.g., macrophages and neutrophils), and other hematopoietic cells involved in an 
immune response. 

10 By "nucleic acid molecule" or "nucleic acid" as used herein, is meant any 

deoxyribonucleic acid (DNA) or ribonucleic acid (RNA), including, without limitation, 
complementary DNA (cDNA), genomic DNA, RNA, heteronuclear RNA (hnRNA), 
messenger RNA (mRNA), DNA/RNA hybrids, or synthetic nucleic acids {e.g., an 
oligonucleotide) comprising ribonucleic and/or deoxyribonucleic acids or synthetic 
15 variants thereof. The nucleic acid of the invention includes, without limitation, an 
oligonucleotide or a polynuc leotide^ The nucleic ^can^e single sttanded, or 
partially or completely double stranded (duplex). Duplex nucleic acids can be 
homoduplex or heteroduplex. 

By "polypeptide" is meant any molecule comprising two or more amino acids 
20 joined together with a peptide bone, regardless of length or post-translational 
modifications {e.g., without limitation, glycosylation, lipidation, acetylation, or 
phosphorylation). Useful plexin polypeptides of the invention include, without 
limitation, the full length plexin polypeptides having the amino acid sequence of SEQ 
ID NOS: 2, 4 , 6, 8 or 10,; an extracellular domain of the polypeptide having the amino 
25 acid sequence 1 to 1199 of SEQ ID NO: 2; 1 to 1099 of SEQ ID NO: 4; 1 to 1270 of 
SEQ ID NO: 6 or 1 to 199 of SEQ ID NO: 8, with its associated signal peptide; or an 
extracellular domain of the polypeptide having the amino acid sequence 19 to 1 1 99 of 
SEQ ID NO: 2; 24 to 1099 of SEQ ID NO: 4; or 43 to 1270 of SEQ ID NO: 6, lacking 
its associated signal peptide; an intracellular domain of the polypeptide having the 
30 amino acid sequence of SEQ ID NOS: 2, 4, 6 or 8; and polypeptides, the amino acid 
sequence of which comprises about residues 1-18 (putative signal sequence), 19- 

. _ 518 (sema domain), 451-530 (1° MRS), 601-680 (2° MRS), 751-830 (3° MRS), 800- 

1010 (G-P repeats) or 1196-1215 (putative transmembrane domain) of SEQ ID 
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NO: 2; about residues 1-23 (putative signal sequence), 24-507 518 (sema 
' domain) or 1100-1119 (putative transmembrane domain) of SEQ ID NO: 4; or 
about residues 1-42 (putative signal sequence), 43-600 (sema domain), 541-620 
(1° MRS), 691-770 (2° MRS), 831-910 (3° MRS), 900-1110 (G-P repeats) or 1270- 
5 1293 (putative transmembrane domain) of SEQ ID NO: 6; or about residue 8-49 
(1° MRS) or 154-199 (2° MRS) of SEQ ID NO: 8. 

By "isolated" is meant a compound (e.g., a nucleic acid molecule or a protein) 
that has been separated from components (e.g., nucleic acid molecules, proteins, lipids, 
and/or carbohydrates) which naturally accompany.it. Water, buffers, and other small 
"10 molecules (e.^ a molecular weight of lessthan about 1000 daltons) 

may accompany an isolated compound of the invention. Preferably, an isolated 
compound is at least 70%, by weight, free from components which naturally 
accompany it. More preferably, an isolated is at least 75%, by weight, free from 
components which naturally accompany it; still more preferably, at least 80%, by 
15 weight, free; even more preferably, at least 85%, by weight, free; and even more 

preferably, at least 90%, by weight, free from components which naturally accompany 
it. Most preferably, a substantially purified compound is at least 95%, by weight, free 
from components which naturally accompany it. 

Where the isolated compound is a nucleic acid molecule, the isolated nucleic acid 
20 molecule is separated from other nucleic acids (e.g., genes or transcripts) or proteins 
which, in the naturally-occurring genome of the organism from which the nucleic acid 
molecule was derived, flanked the nucleic acid molecule. Isolated nucleic acid 
molecules therefore include, without limitation, a recombinant nucleic acid molecule 
incorporated into a plasmid or other vector (e.g., a replication-defective virus); a 
25 recombinant nucleic acid molecule incorporated into the genome of a prokaryotic or 
eukaryotic organism; or a nucleic acid molecule which exists as a separate molecule 
independent of other nucleic acids (e.g., a PCR product, a chemically synthesized nucleic 
acid molecule, or a nucleic acid molecule produced by restriction endonuclease 
digestion). Purification of a nucleic acid molecule can be accomplished and measured by 
30 any standard method including, without limitation, sequence analysis, chemical • 
synthesis, FCR, CsCl gradient, phenol:chloroform extraction, ethanol precipitation, 
Southern or Northern blotting analysis followed by band extraction and purification, and 



WO 01/14420 



-13- 



PCT/USOO/23365 



high performance liquid chromatography (HPLC; see, e.g., Fisher (1980) Laboratory 
Teghni flues in Bjgc hermstry »nrf MolgcuJaiBiology,, Work and Burdon (eds.), Elsevier). 

Thus, in one non-limiting example, to obtain an isolated nucleic acid molecule 
encoding a plexin polypeptide, a nucleic acid molecule is chemically synthesized on a 

5 standard oligonucleotide synthesis machine. The resulting single stranded molecule is 
then subjected to second strand synthesis to form a duplex DNA molecule, which is then 
ligated into a plasmid capable of replication in a prokaryotic or eukaryotic cell. The 
nucleic acid molecule is then replicated in the cell, purified {e.g., by CsCl gradient), and 
subjected to sequence analysis. 

K) -In certain ^mhommen^^ the nucleic acid 

molecule has a nucleic acid sequence comprising SEQ ID NOS: 1, 3, 5, 7 or 9. 
Preferably, the nucleic acid molecule of the invention has not more than 500 nucleotides 
flanking each of the 5' and 3' ends of SEQ ID NOS: 1, 3, 5, 7 or 7. In certain 
embodiments, the plexin polypeptide has an amino acid sequence that comprises SEQ ID 

15 NOS: 2, 4, 6, 8 or 10. Preferably, the plexin polypeptide of the invention has not more 
than SOamino acid residues flanking each of the N-terminal and C-terminal ends of SEQ 

ID NOS: 2, 4, 6, 8 or 10. 

In certain embodiments of the first aspect of the invention, the nucleic acid 
molecule hybridizes under stringent conditions (as defined herein) to SEQ ID NOS: 1, 3, 
20 5, 7 or 9. 

The invention also includes nucleic acid molecules that hybridize under 
stringent hybridization conditions (as defined herein) to all or a portion of the 
nucleotide sequence represented by SEQ ID NOS: 1, 3, 5, 7 or 9 or its complement. The 
hybridizing portion of the hybridizing nucleic is at least 80%, e.g., at least 95%, or at 
25 least 98%, homologous to the sequence of a portion or all of a nucleic acid encoding a 
polypeptide having the amino acid sequence of SEQ ID NOS: 2, 4, 6, 8 or 10, or its 
complement. Hybridizing nucleic acids of the type described herein can be used, for 
example , as a cloning probe, a primer (e.g., a PCR primer) or a diagnostic probe. 

Hybridization of the oligonucleotide probe to a nucleic acid sample typically is 
30 performed under stringent conditions. Nucleic acid duplex or hybrid stability is 

exoressed as the melting temperature or Tm, which is the temperature at which a probe 
dissociates from a target DNA. This melting temperature is used to define the required 
stringency conditions. If sequences are to be identified that are related and substantially 
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identical, rather than identical, then it is useful to first establish the lowest temperature 
at which only homologous hybridization occurs with a particular concentration of salt 
(e.g., SSC or SSPE). Then, assuming that 1% mismatch results in a 1°C decrease in the 
Tm, the temperature of the final wash in the hybridization reaction is reduced 
accordingly (for example, if the sequences have > 95% identity with the probe are 
sought, the final wash temperature is decreased 5°C). In practice, the change in the Tm 
can be between 0.5 C and 1.5 C per 1% mismatch. "Stringent conditions" involve 
hybridization at 68°C in 5x SSC/5x Denhardt's solution/1.0% SDS, and washing in 
0.2xSSC/0.1% SDS at room temperature. "Moderately stringent conditions" include 
washing in 3xSSG at 42°C. The parameters of salt concentration and temperature can -- 
be varied to achieve the optimal level of identity between the probe and the target 
nucleic acid. Additional guidance regarding such conditions is readily available in the 
art, for example, by Sambrook et al. f supra] and Ausubel et al, supra. 

Nucleic acid sequence homology (as well as amino acid sequence homology) can 
be measured according to standard methods. Unless otherwise specified, as used herein 
used herein, "percent APIPology" of two amino acid sequences or of two nucleic acids is 
determined using the algorithm of Karlin and Altshul (Proc. Natl. Acad. Sci. USA 87: 
2264-2268, 1990), modified as in Karlin and Altschul (Proc. Natl. Acad. Sci. USA 90: 
5873-5877, 1993). Such an algorithm is incorporated into the NBLAST and XBLAST 
programs of Altschul et al. (J. Mol. Biol. 215: 403-410, 1990). BLAST nucleotide 
searches are performed with the NBLAST program, e (score) = 100, word length = 12, to 
obtain nucleotide sequences homologous to a nucleic acid molecule of the invention. 
BLAST protein searches are performed with the XBLAST programme (score) = 50, word 
length = 3, to obtain amino acid sequences homologous to a reference polypeptide (e.g., 
SEQ ID NO: 2). To obtain gapped alignments for comparison purposes, Gapped BLAST 
is utilized as described in Altschul et al. (Nucleic Acids Res. 25: 3389-3402, 1997). 
When utilizing BLAST and Gapped BLAST programs, the default parameters of the 
respective programs (e.g., XBLAST and NBLAST) are used, namely e=10; w=ll for 
nucleic acid; w=3 for amino acid (the Blosum 62 scoring matrix); low complexity 
sequence filtering. The default settings of BLAST emphasize regions of local alignment 
to detect relationships among sequences which share only isolated regions of similarity 
(Altschulet al., /. Mol Biol. 215s 403-410 (1990). See http;//www,ncbi;nlm.nih.gov . 
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Thus, in a non-limiting example to obtain an isolated nucleic acid molecule 
encoding a plexin polypeptide, a nucleic acid molecule having the sequence of SEQ ED 
NOS: 1, 3, 5 or 7 is used to probe a cDNA library under stringent conditions according to 
standard techniques (see., e.g., Ausubel et al, supra). Upon identification of a positive 
5 clone (i.e., a clone that hybridizes to SEQ ID NOS: 1, 3, 5 or 7 under stringent 

conditions), that clone is expanded and subjected to sequence analysis. A nucleic acid 
molecule having a nucleic acid sequence that is at least 70% identical, preferably at least 
75% identical, more preferably, at least 80% identical, still more preferably at least 85% 
identical, even more preferably, at least 90% identical, and most preferably at least 95% 
10 -identical (as measured by the basic BLAST-program of NCBI on default settings) to SEQ 
ID NOS: 1, 3, 5 or 7 is a nucleic acid molecule of the invention. 

In a second aspect, the invention provides four novel isolated plexin polypeptides. 
"Isolated" is as defined for the first aspect of the invention. Where the isolated 
compound is a polypeptide, the isolated polypeptide is separated from organic molecules, 
15 such as nucleic acid molecules, polypeptides, and/or carbohydrates, which, in the 

naturally-occurring .organism from which the polypeptide was derived, accompany the 
polypeptide. Isolated polypeptides therefore also include a recombinant polypeptide 
(e.g., a human polypeptide expressed in an insect cell), or a chemically synthesized 
polypeptide. Purification of a polypeptide can be accomplished and measured by any 
20 standard method including, without limitation, chemical synthesis, recombinant 
polypeptide expression in prokaryotic or eukaryotic cells, affinity chromatography, 
Western blotting analysis, SDS-PAGE analysis, and/or HPLC. 

In accordance with this aspect, the invention provides all derivatives, mutants, 
truncations, and/or splice variants of the four novel plexin polypeptides, so long as these 
derivatives, mutants, truncations, and/or splice variants share at least 60% amino acid 
sequence homology with SEQ ID NOS: 2 ,4 ,6 or 8, preferably, at least 65% sequence 
homology, more preferably, at least 70% sequence homology, more preferably, at least at 
least 75% sequence homology, more preferably, at least 80% sequence homology, still 
more preferably at least 85% sequence homology, even more preferably, at least 90% 
30 sequence homology, and most preferably at least 95% sequence homology with SEQ ID 
NOS: 2 ,4 ,6 or 8 as determined using the basic BLAST program of the National Center 
for Biotechnology (NCBI; National Library of Medicine, Bethesda, MD),. using the 
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default settings defined therein using the sequence of the four novel plexin derivative, 
mutant, truncation and/or splice variance as the probe. 

Preferred plexin polypeptide derivatives include polypeptides whose sequences 
differ from the sequence given in SEQ ID NOS: 2 ,4 ,6 or 8, by one or more 
5 conservative amino acid substitutions, or by one or more non-conservative amino acid 
substitutions, deletions or insertions which do not abolish the biological activity of the 
plexins. Conservative amino acid substitutions typically include the substitution of one 
amino acid for another with similar biochemical characteristics, such as polarity, size, 
and/or charge. Non-limiting examples of conservative substitutions are substitutions 
10 within the following groups: valine, glycine, glycine, alanine, valine, isoleucine,- 

leucine; aspartic acid, glutamic acid, asparagine, glutamine; serine, threonine; lysine, 
arginine, phenylalanine, and tyrosine. 

Useful methods for mutagenesis to generate plexin mutants are known in the art 
(see, e.g., Sambrook et aL, supra', Ausubel et aL, supra). Preferred methods for 
15 mutagenesis are described in PCT Publication W099/12965 and include PGR 

mutagenesis and saturation mutagenesis. _A library of random amino acid sequence 
variants can also be generated by the synthesis of a set of degenerate oligonucleotide 
sequences. 

In certain embodiments of the second aspect of the invention, the plexin 
20 polypeptide has a sequence comprising the sequence of SEQ ID NOS: 2 ,4 ,6 or 8. In 
one non-limiting example, in accordance with the invention, an isolated plexin 
polypeptide comprising the sequence of SEQ ID NOS: 2 ,4 ,6 or 8 can chemically 
synthesized according to standard techniques (e.g., at a commercial peptide generating 
facility). 

25 For example, a putative plexin polypeptide is purified and subjected to N- 

terminal sequencing to determine its amino acid sequence. The amino acid sequence of 
the polypeptide is then compared to SEQ ID NOS: 2 ,4 ,6, 8 or 10 (as measured by the 
basic BLAST program of NCBI on default settings). A polypeptide that shares at least 
60% homology with SEQ ID NOS: 2 ,4 ,6, 8 or 10 is a plexin polypeptide of the 

30 invention. 

In another example, purification of a plexin polypeptide is facilitated by the 
addition of a tag to the polypeptide that enables purification of the taggedvpolypeptide. 
Non-limiting examples of a tag include a hemagglutinin (HA) tag, a his tag, a GST tag, a 
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FLAG-tag, and a myc tag. Thus, a nucleic acid molecule of the first aspect is engineered 
using standard molecular biology techniques to incoiporate the nucleic acid sequence 
encoding the tag. The engineered nucleic acid molecule is then introduced and 
positioned for expression in an appropriate cell to produce the recombinant tagged 

5 polypeptide, which can then be readily purified by binding of the tag to its substrate. For 
example, the his tag binds to Ni-NTA agarose. Likewise, a GST (glutathione S- 
transferase) tag binds to glutathione agarose beads. Both his tag and GST tag expression 
and purification kits are commercially available from PharMingen (San Diego, CA). 
Likewise, myc-tagged plexin polypeptide are produced by cells introduced with a nucleic 

10 acid molecule encoding the tagged protein and positioned for expression in the cell. 

It will be appreciated that particularly useful polypeptides of this aspect of the 
invention are secreted by the cell in which they are produced, thus facilitating 
purification of the polypeptide from the culture media in which the cells have been 
maintained, without requiring lysis of the cell. 

15 In a third aspect, the invention provides a cell engineered to comprise a nucleic 

acid molecule enco_dingjine of die four p±exmj>cjyp^^^ meant 
that the cell of the invention has been modified by standard molecular biology 
techniques. Where the cell is "engineered to comprise a nucleic acid molecule," standard 
molecular biology techniques have been employed to introduce the indicated nucleic acid 

20 molecule into the cell, either by transformation or transfection of the cell with a plasmid, 
or by infection or transduction of the cell with a recombinant virus. 

The nucleic acid molecule of the first aspect of the invention is preferably 
subcloned into a plasmid or vector (for example, but not limited to, a vector used to 
generate a recombinant virus), wherein the nucleic acid molecule is positioned for 

25 expression in the plasmid or vector. The plasmid or vector is then introduced into a cell 
by standard techniques to produce an engineered cell in accordance with the third aspect 
of the invention. 

In certain embodiments of the third aspect, the cell is a prokaryotic cell {e.g., a 
bacterium). For example, E. coli cells {e.g., DH5a) are transformed (using, e.g., 
30 electroporation) with a bacterial plasmid {i.e., a plasmid containing an E. coli origin of 
replication) containing a nucleic acid molecule of the first aspect of the invention. The 
transformed bacteria are selected using; for example, an antibiotic-resistance encoding 
nucleic acid molecule {e.g., ampicillin resistance) on the plasmid such that the antibiotic 
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resistance protein is expressed by the transformed bacteria. The transformed bacteria are 
then propagated, and can be cryopreserved and stored frozen in glycerol. 

Those of skill in the art will appreciate that in accordance with the third aspect of 
the invention, a nucleic acid molecule encoding one of the four plexin polypeptides may 
5 be introduced into a large variety of cells. For example, a nucleic acid molecule 
encoding one of the four plexin polypeptides can be introduced into prokayotic cells 
(e.g., bacteria), and any eukaryotic cell into which an exogenous nucleic acid molecule 
may be introduced. Thus, in certain embodiments of the third aspect of the invention, the 
cell is a eukaryotic cell. Eukaryotic cells according to this aspect of the invention that 
10 comprise a nucleic acid molecule encoding one of thefour plexin polypeptides include, 
without limitation, yeast cells, plant cells, insect cells, and mammalian cells. Within the 
category of mammalian cells are cells from any mammalian species (including, without 
limitation, mouse, hamster, monkey, human), of any lineage (including, without 
limitation, lymphocyte, fibroblast, stem cell), and may be an immortalized cell, or a non- 
15 immortalized cell. Cells, as well as plasmids and/or vectors (e.g., vectors that can be 
packaged to form infectious virus particles), are commercially available, for example, 
from the American Type Culture Collection ("ATCC"; Manassas, VA). 

In certain embodiments of the third aspect of the invention, the nucleic acid 
molecule is positioned for expression in the cell. By "positioned for expression" is 
20 meant that the nucleic acid molecule is operably linked to at least one regulatory 

sequence which directs the transcription and translation of the nucleic acid molecule in a 
cell, such that the eel! engineered to comprise the nucleic acid molecule produces (i.e., 
expresses) the protein encoded by the nucleic acid molecule. By "operably linked" is 
meant that the nucleic acid molecule and the regulatory sequence are connected in a such 
25 a way as to permit expression of the nucleic acid molecule when the nucleic acid 
molecule is present in a cell. Regulatory sequences include, without limitation, 
promoters, enhancers, IRES sequences, and polyadenylation signals. Since piexin 
polypeptides are involved in immune regulation and the modulation of cell growth, it 
may be .desirable to operably link a nucleic acid molecule encoding one of the four plexin, 
30 polypeptides to an inducible promoter. 

The four plexin polypeptides that are encoded by the nucleic acid molecules do 

-not necessarily include the transmembrane-domain of the four. plexin polypeptides, and. 

so may be produced by the cell as an intracellular polypeptide or a soluble secreted 
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polypeptide. For example, if the polypeptide fragment is secreted by the cell, it can be 
purified from the conditioned growth media of the transfected cells, without having to 
lyse the cells. Likewise, although a soluble intracellular polypeptide fragment is purified 
from only lysed cells, the fragment, being soluble, does not have to be extracted from the 

5 cell membrane; thus, different lysis conditions may be used to obtain purified soluble 
intracellular polypeptide fragment as compared to the lysis conditions required to obtain 
purified full length plexin polypeptides (which has a transmembrane domain). 

Protein expression systems have been established for a variety of cells and are 
known to those of skill in the art. Cells are also commercially available from the ATCC, 

10 and a variety of protein expression system kits are commercially available from, for 
example, Invitrogen Corp. (Carlsbad, CA), Clontech Laboratories (Palo Alto, CA), 
PharMingen (San Diego, CA), Promega Corp. (Madison, WI), and Stratagene (La Jolla, 
CA). 

For example, a nucleic acid molecule encoding one of the four plexin 
15 polypeptides is operably linked to bacterial regulatory sequences (e.g., T7 late promoter 
or bacteriophage regulatory ;^uctcm)^^ AejesuWi^.nr^c acid molecule is used to 
transform bacterial cells, where the transformed bacterial cells produce one of the four 
plexin polypeptides. In another example, a nucleic acid molecule encoding one of the 
four plexin polypeptides is operably linked to baculovirus regulatory sequences in a 
20 baculovirus vector. Recombinant baculovirus are then generated and used to transduce 
insect cells (using, for example, the expression kit commercially available from Clontech 
Laboratories. The transduced insect cells comprise a nucleic acid molecule encoding one 
of the four plexin polypeptides positioned for expression in the insect cell, and thus 
produce one of the four plexin polypeptides. 

Mammalian cells are widely used as protein expression systems. For example, a 
mammalian cell may be transduced with a recombinant retrovirus or adenovirus 
comprising a nucleic acid molecule encoding one of the four plexin polypeptides 
operably linked to regulatory sequences that are either endogenous to the particular virus 
or exogenous to the virus (e.g., a CMV promoter in a retroviral vector). The transduced 
30 mammalian cell is then propagated in vitro in tissue culture, in vivo (e.g., in an 
immunocompromised animal), and/or cryopreserved and stored frozen in DMSO. 

In another example, mammalian cells^are transfected with . an expression plasmid 

comprising a nucleic acid molecule encoding one of the four plexin polypeptides 



25 



WO 01/14420 



-20- 



PCT/USOO/23365 



operably linked to one or more regulatory sequences on the plasmid. By "expression 
plasmid" is meant a plasmid in which an inserted nucleic acid molecule of interest {e.g., 
encoding one of the four plexin polypeptides, a plexin chimeric molecule, or tagged 
plexin polypeptide) is operably linked to at least one regulatory sequence such that when 
5 the expression plasmid containing the inserted nucleic acid molecule of interest is 
introduced {e.g., by transfection) into a cell, the inserted nucleic acid molecule is 
positioned for expression in that cell. The nucleic acid molecule in the expression 
plasmid, upon being introduced into the cell, is thus positioned for expression in that cell, 
and enables the cell to produce one of the four plexin polypeptides encoded by the 

-10 nucleic acid molecule. — - - - - - 

In one non-limiting example, a nucleic acid molecule according to the first aspect 
of the invention is inserted into a standard mammalian expression plasmid {e.g., 
pcDNA3.1 commercially available from Invitrogen Corp., Carlsbad, California), such 
that the inserted nucleic acid molecule encoding one of the four plexin polypeptides is 

15 operably linked to the regulatory sequences in the mammalian expression plasmid. 

Mammalian cells are then transfected with this expression plasmid (using, e.g., CaP0 4 or 
DEAE-dextran). Where the expression plasmid contains an antibiotic-resistance 
encoding nucleic acid molecule {e.g., neomycin resistance on the pCDNA3.1 plasmid) 
such that the antibiotic resistance protein is expressed by the transfected cells, transfected 

20 cells may be selected for the ability to grow in the presence of the antibiotic. The 
transfected cells may then be propagated and cryopreserved and stored in frozen in 
DMSO. 

In a fourth aspect, the invention provides an isolated nucleic acid molecule 
encoding a chimeric molecule comprising at least two segments, wherein one of the 
25 segments comprises one of the four plexin polypeptides. By "chimeric molecule" is 
meant a protein that comprises at least two segments of polypeptide joined together by 
any means, including, without limitation, a covalent bond {e.g., peptide bond), a non- 
covalent bond {e.g., ionic bond or hydrogen bond) or by a chemical crosslinker. It should 
be noted that one of the four plexin polypeptides that has been tagged is within the 
30 definition of a chimeric molecule. 

In certain embodiments of the fourth aspect of the invention, the nucleic acid 
. ...molecule encoding the segment pf. a chimeric molecule comprising one of the. four plexin 
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polypeptides hybridizes under stringent conditions to SEQ ID NO: 1, 3, 5 or 7. 
"Stringent conditions" are as described above for the first aspect of the invention. 

Standard molecular biology techniques may be employed to generate nucleic acid 
molecules encoding chimeric molecules according to the fourth aspect of the invention. 
For example, a nucleic acid molecule encoding the extracellular domain of one of the 
four plexin polypeptides may be joined, in frame, to a nucleic acid molecule encoding the 
constant region of an immunoglobulin molecule (see, e.g., Chamow S.M., Antibody 
Fusion Proteins, John Wiley & Sons, New York, 1999). By "in frame" is meant that a 
first nucleic acid molecule is ligated to a second nucleic acid molecule such that the each 
of the amino acid sequences of the polypeptides encoded by each of the first and the 
second nucleic acid molecules is not frame-shifted. 

In one non-limiting example, a chimeric molecule comprising the extracellular 
domain of one of the four plexin polypeptides including the amino acid sequence of SEQ 
ID NOS: 2, 4, 6 or 8 is generated. In this example, a nucleic acid molecule encodes 
amino acid residue number 1(19) through about amino acid residue number 1 199 of SEQ 
ID NO: 2; amino acid residue number 1(24) through about a m i n o acid residue number 
1099 of SEQ ID NO: 4; amino acid residue number 1(43) through about amino acid 
residue number 1270 of SEQ ID NO: 6 and amino acid residue number 1 through about 
amino acid residue number 199 of SEQ ID NO: 8 with its associated signal peptide 
(parenthesis depicts about the beginning of the amino acid sequence of the extracellular 
domain lacking its signal peptide). This nucleic acid molecule is fused in frame with a 
nucleic acid molecule encoding the constant region of an immunoglobulin, such that the 
chimeric molecule encoded by the resulting nucleic acid molecule generally has the 
following structure: 



N-terminus 


extracellular domain of SEQ ID 
NO: 2 with or lacking its signal 
peptide 


amino acids from the constant 
region of an Ig molecule 


C-terminuf 


N-terminus 


extracellular domain of SEQ ID 
NO: 4 with or lacking its signal 
peptide 


amino acids from the constant 
region of an Ig molecule 


C-terminu; 


N-terminus 


extracellular domain of SEQ ID 
NO: 6 with or lacking its signal 
peptide 


amino acids from the constant 
region of an Ig molecule 


C-terminu! 


N-terminus 


extracellular domain of SEQ ID. 
1 NO: 8 with or lacking its signal 


amino acids from the constant 
region of an Ig molecule 


C-terminu 
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| peptide j 

The heavy chain class {e.g., IgG, IgA, IgM, IgD, or IgE) can be. varied in this chimeric 
molecule depending upon which constant region is used. Nucleic acid molecules 
encoding the constant region of various immunoglobulin (Ig) heavy chains are known 
5 (see, e.g., Zettlmeiss] et al., DNA Cell Biol 9(5):347-53, 1990) Indeed, expression 
plasmids are available, into which the nucleic acid molecule of interest (Le., a nucleic 
acid molecule encoding an extracellular domain of the polypeptide of SEQ ID NO: 2; 
SEQ ID NO: 4; SEQ ID NO: 6; or SEQ ID NO: 8) can be inserted, and the resulting 
_plasmid introduced into a cell to produce oneiofthe four extracellular pl exin-I g chimeric 
10 molecule s (see, e.g., Zettlmeissl et al., supra; Miller et al., J. Exp. Med. 178 (1): 211- 
222, 1993). 

Any variety of chimeric molecule carrying the extracellular domains of one of the 
four plexin polypeptide may be generated. For example, the extracellular domain of one 
of the four plexin polypeptides can be myc-tagged, his-tagged, or FLAG tagged 

15 according to standard molecular biology techniques. 

Such extracellular proteins are particularly useful for identifying ligands to which 
the extracellular domain of one of the four plexin polypeptides bind. For example, 
extracellular plexin-Dl-Ig chimera can be immobilized on a protein A-sepharose column. 
Molecules suspected of binding the extracellular domain of plexin-Dl are added to the 

20 column, to which the molecule that binds to the extracellular domain of plexin-Dl 
adhere, and the non-binding molecules flow through the column. The extracellular 
plexin-Dl -binding molecules are readily eluted, for example, by changing the pH or ion 
concentration of the elution buffer. 

Extracellular plexin proteins are also used to identify cells expressing the ligand 

25 of plexin extracellular domain on their cell surface (and thereby also identify the ligand 
itself). For example, cells are incubated with a FLAG-tagged plexin extracellular domain 
chimeric molecule. A FLAG-tagged plexin extracellular domain chimeric molecule is 
generated. An anti-FLAG antibody that is detectably labeled is then added to the cells. 
By "detectably labeled" is meant any means for marking and identifying the presence of a 

30 molecule. Detectable labels include, without limitation, radioactive labels (e.g., 32 P or 
35 S) and fluorophpre labels (e.g., FTTC, phycoerythrin, or rhodamine). For example, 
FTTC-labeled anti-FLAG antibodies are commercially available from Babco, Richmond, 
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CA. The "stained" cells (i.e., cells incubated first with the FLAG-tagged plexin 
extracellular domain chimeric molecule and then with the FITC-labeled anti-FLAG 
antibody), are then subjected to flow cytometry analysis to select those cells that are 
labeled with FITC, and so express a molecule that binds to the extracellular domain of 
5 one of the four plexin polypeptides. The FITC labeled cells are then further manipulated 
(e.g., characterized to determine which cells express the plexin polypeptide ligand). 

The ligand of the plexin extracellular domain is itself identified, for example, by 
lysing the cells, adding the Iysate to one of the four plexin extracellular domain-Ig 
chimeric molecule columns described above, and purifying the ligand. The ligand is then 

40 sequenced by N-terminal -sequencing. - 

In another non-limiting example, the intracellular domain of one of the four 
plexin polypeptides is used as "bait" in a yeast two-hybrid system to identify ligands that 
interact with the intracellular domain of one of the four plexins described herein. For 
general description of the two-hybrid system, see U.S. Patent Nos. 5,283,173; 5,468,614; 

15 and 5,695,941. In this example, a nucleic acid molecule encoding from about amino acid 
residue number 143 through at least amino acid residue number 214 of SEQ ID NO: 2 is 
inserted into a standard DNA binding domain expression plasmid (e.g., the GAL4 DNA 
binding domain plasmid in the Interactor kit commercially available from PharMingen 
(San Diego, CA). (It will be understood that the nucleic molecule may encode amino 

20 acid residue number 138-148 through at least amino acid residue number 214 of SEQ ID 
NO: 2.) A variety of cDNA libraries in transcriptional activation domain vectors are 
available (e.g., from Clontech, Palo Alto CA). The cDNA libraries are screened 
employing standard methods (see, e.g., the methods employed in U.S. Patent No. 
5,780,262) to identify cDNA clones encoding a ligand that binds to the intracellular 

25 domain of one of the four plexin polypeptides. One preferable cDNA library screened in 
this example is a cDNA library generated from an immune cell (e.g., a lymphocyte or NK 
cell). 

In a fifth aspect, the invention provides a purified chimeric molecule comprising 
one of the four plexin polypeptides. Methods for purifying proteins are as described for 
30 the second aspect of the invention. 

In a sixth aspect, the invention provides a cell engineered to comprise a nucleic 
. acid molecule encoding a chimeric molecule comprising at least two segments, wherein ^ 
• one of the segments comprises one of the four plexin polypeptides. As described for the 
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third aspect of the invention, a nucleic acid encoding a chimeric molecule comprising 
one of the four plexin polypeptides may be introduced into any variety of cells. In certain 
embodiments, the cell is a prokaryotic cell or a eukaryotic cell. In certain embodiments, 
the eukaryotic cell is a yeast cell or a mammalian cell {e.g., a human cell). 
5 In a seventh aspect, the invention provides an isolated binding agent that 

specifically binds one of the four plexin polypeptides, or specifically binds a chimeric 
molecule comprising a segment comprising one of the four plexin polypeptides. In 
certain embodiments, the plexin protein has an amino acid sequence comprising SEQ ID 
NOS:2, 4, 6 or 8. 

. -10 By "specifically.binds is -meant a binding agent-O-g., an antibody) that-binds to 

its specific target (e.g., one of the four plexin polypeptides) with greater affinity than it 
binds to other molecules. Preferably, where the binding agent is an antibody, the 
antibody preferably specifically binds to its specific target with a dissociation constant 
(Kd) of at least 10" 5 M, more preferably, 10" 6 M, even more preferably 10" 7 M, and most 
15 preferably, the binding agent specifically binds to its specific target with a Kd of at least 
l<r 8 M. 

Preferably, the binding agent of this aspect of the invention is an antibody, such 
as a monoclonal antibody or a polyclonal antibody, or a fragment of an antibody that 
specifically binds one of the four plexin polypeptides. Standard methods may be 
20 employed to generated both monoclonal and polyclonal antibodies that specifically bind 
to one of the four plexin polypeptides of the invention. See, e.g., Ausubel et al., supra-, 
Coligah, J.E. et aL, Current Protocols in Immunology , John Wiley & Sons, New York 

■4 

(1991); and Delves, P. J., Antibody Production: Essential Techniques , John Wiley & 
Sons, New York (1997). Briefly, the plexin polypeptides of the present invention, 

25 purified according to the methods described for the second aspect of the invention, are 
used to immunize rabbits (e.g., for polyclonal antibodies) or mice (e.g., for monoclonal 
antibodies) to generate antibody-mediated immunity to the four plexin polypeptides used 
to immunize the animal. For monoclonal antibodies, antibodies can be screened by, e.g., 
ELISA, to identify those antibodies that show the highest affinity for the immunizing 

30 plexin protein of polypeptide fragment. The cloned cell producing the high affinity 

monoclonal antibody can then propagated in vitro (where the antibody is purified from 
the culture supernatant) or in vivo (where the antibody is purified from ascites fluid), and 
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can also be cryopreserved and stored frozen at, e.g., -70°C in DMSO, to provide a 
potentially limitless supply of monoclonal antibody. 

In addition to intact monoclonal and polyclonal antibodies, the invention also 
provides various antibody fragments, such as Fab, F(ab') 2 , Fv, and sFv fragments. 
5 Recombinant, chimeric, and humanized antibodies are also provided. 

Recombinant "humanized antibodies" which specifically bind to one of the four 
plexin polypeptides can be synthesized according to methods known in the art (see, e.g., 
Green L.L. et al., Nature Genetics 7: 13-21, 1994 for fully humanized antibodies 
expressed in transgenic animals; see also U.S. Patent Nos: 5,693,761; 5,777,085; and 

10 5,585,089). Humanized antibodies are chimeras comprising mostly human IgG 

sequences into which at least portions of the regions responsible for specific antigen- 
binding (e.g., CDR's) have been inserted. Animals are immunized with the desired 
antigen, the corresponding antibodies are isolated, and the portion of the variable region 
sequences responsible for specific antigen binding are removed. The animal-derived 

15 antigen binding regions are then cloned into the appropriate position of human antibody 
genes in which the antigen binding regions have been deleted. Humanized antibodies 
minimize the use of heterologous (Le., inter-species) sequences in human antibodies, 
and thus are less likely to elicit immune responses in the treated subject (see also, e.g., 
U.S. Patent No. 5,807,715). 

20 Construction of different classes of recombinant antibodies can also be 

accomplished by making chimeric or humanized antibodies comprising nonhuman 
variable domains and human constant domains (CHI, CH2, CH3) isolated from 
different classes of immunoglobulins. For example, antibodies with increased antigen 
binding site valencies can be recombinantly produced by cloning the antigen binding 

25 site into vectors carrying the human chain constant regions (see, e.g., Arulanandam et 
al,J. Exp. Med. 177: 1439-1450, 1993). 

In addition, standard recombinant DNA techniques can be used to alter the 
binding affinities of recombinant antibodies with their antigens by altering amino acid 
residues in the vicinity of the antigen binding sites. The antigen binding affinity of a 

30 humanized antibody can be increased by mutagenesis based on molecular modeling 
(see, e.g., Queen et al., Proc. Natl. Acad. Sci. 86: 10029-10033, 1989). 

Also provided in-the invention are plexin. polypep tide-specific single 
polypeptide chain antibodies (see general methods in U.S. Patent Nos. 4,946,788 and 
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4,704,692); single.domain antibodies (Ward, E.S. et ah, Nature 341: 544-546, 1989); 
and chimeric antibodies (U.S. Patent No. 4,816,567). 

Binding agents that specifically bind the plexin polypeptides of the present 
invention are useful, for example, in determining expression levels of the plexin 
polypeptides in various tissues of the body, Western blotting analysis, and 
immunochromatography. Particularly, binding agents that specifically bind the plexin 
polypeptides are useful for binding the plexin polypeptide on a cell expressing the 
plexin polypeptide, thereby activating the cell. 

A binding agent that specifically binds one of the four plexin polypeptides, for 
..example, _ is .effective, as .an immune modulator. Additional applications in clude , without 
limitation, an injectable formulation comprising a binding agent that specifically binds 
one of the four plexin polypeptides that is useful to antagonize activity in a disease 
involving aberrant immune regulation or a disease involving aberrant cell growth. 

In an eighth aspect, the invention provides an isolated antisense oligonucleotide 
complementary to a portion of a nucleic acid molecule encoding one of the four plexin 
polypeptides. In certain embodiments, hybridization of the antisense oligonucleotide to 
the nucleic acid molecule inhibits transcription or translation of the nucleic acid 
molecule. 

By two nucleic acid molecules being "complementary" to one another is meant 
that the first nucleic acid molecule {e.g., an oligonucleotide) is able to form Watson- 
Crick base pair hydrogen bonds (i.e., hybridize) with the second nucleic acid molecule to 
form a duplex. The first nucleic acid molecule is thus a "complement" of the second 
nucleic acid molecule. 

The antisense oligonucleotides according to the invention are complementary to a 
region of a nucleic acid molecule (or a region at the intron/exon boundary of DNA or 
RNA) that encodes one of the four plexin polypeptides. Preparation of antisense 
oligonucleotides is well known (see, e.g., Agrawal et al. 9 Trends Biotechnol. 10:152-158, 
1992; U.S. Patent No. 5,149,798; Agrawal et al y Proc. Natl Acad. Sci. USA 85:7079- 
7083, 1988; Froehler, Tetrahedron Lett. 27:5575-5578, 1986; and Bergot et al., J. 
Chromatog. 559:35-42, 1992. 

In a ninth aspect, the invention provides a method for identifying a nucleic acid 
molecule encoding one of the four plexin polypeptides, comprising contacting a pool of 
candidate nucleic acid molecules with a nucleic acid molecule encoding one of the four 
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plexin polypeptides, wherein hybridization of the nucleic acid molecule encoding one of 
the four plexin polypeptides under stringent conditions to a candidate nucleic acid 
molecule identifies the candidate nucleic acid molecule as a nucleic acid molecule that 
encodes one of the four plexin polypeptides. According to this aspect of the invention, 
5 "hybridization" and "stringent conditions" are as defined above for the first aspect of the 
invention. In certain embodiments, the nucleic acid molecule encoding one of the four 
plexin polypeptides has a nucleic acid sequence comprising SEQ ID NOS: 1, 3, 5 or 7. 

It will be understood that the isolated plexin polypeptides according to the second 
aspect of the invention, the plexin chimeric molecules according to the fifth aspect of the 

-10 in vention, and-bindi ng agents -that specifically bind the plexin polypeptides according to 
the seventh aspect of the invention, are useful as therapeutics to treat an individual 
suffering from, or suspected of having, a disease or disorder involving aberrant immune 
regulation or an individual suffering from, or suspected of having, a disease or disorder 
involving aberrant cell growth, particularly nerve cell growth. 

!5 By "disease or disorder involving aberrant immune regulation" is meant any 

disease or disorder in which an abnormal immune response is generated in response to 
either self or foreign antigens. Thus, this definition includes, without limitation, 
autoimmune diseases (e.g., lupus, inflammatory bowel disease, or Diabetes Type 1) and 
immunosuppressive diseases (e.g., multiple sclerosis or rheumatoid arthritis). 

20 By "disease or disorder involving aberrant cell growth" is meant any disease or 

disorder in which an abnormal amount of cell growth is observed. "Cell growth" is 
defined above. Thus, diseases and disorders involving aberrant cell growth include 
hyperplasia, neoplasia, and cancer, as well as degenerative diseases, such as 
neurodegenerative diseases. 

25 Preferable therapeutically useful plexin polypeptides are soluble polypeptides 

{e.g., lacking the hydrophobic transmembrane domain of the plexin polypeptides), 
particularly soluble polypeptide fragments that are secreted by the cell in which the 
fragment was produced. In a preferred embodiment the soluble plexin polypeptides are 
selected from the group consisting of plexin- A- 1 (Maestrini et al. 1996 supra), plexin- A- 

30 2 (Maestrini et al. 1996 supra), plexin-A-3 (Maestrini et al. 1996 supra), plexin-A-4, 

plexin-B-1 (Maestrini et al. 1996, supra), plexin-B-2, plexin-B-3, plexin-Cl (Comeau et 
al. 1998 supra), plexin-D-1 •• «-*r- - 
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In a tenth aspect, the invention provides a method for diagnosing a disease 
involving aberrant immune regulation or a disease involving aberrant cell growth, 
comprising comparing the amino acid sequence of one of the four plexin polypeptides 
from an individual suspected of having the disease with the amino acid sequence of one 
5 of the four plexin polypeptides from an unaffected individual, wherein the presence of a 
difference between the two amino acid sequences identifies the individual suspected of 
having the disease as having the disease. "Disease or disorder involving aberrant 
immune regulation" and "disease or disorder involving aberrant cell growth" are as 
defined above. 

-1-0 By "difference"Jn. the amino a^ polypeptides _ 

from an individual suspected of having the disease or disorder as compared with the 
amino acid sequence of one of the four plexin polypeptides from an unaffected 
individual, is meant any mutation that changes the amino acid sequence including 
substitution, deletion, of addition of one or more amino acid residues. 

15 Thus, in one nonlimiting example, one of the four plexin polypeptides is 

extracted from cells of an individual suspected of having a disease involving aberrant 
immune regulation (e.g., using an antibody according to the seventh aspect of the 
invention). The amino acid sequence of the plexin polypeptide is determined by N- 
terminal sequencing and compared to the amino acid sequence of one of the four plexin 

20 polypeptides from an unaffected individual (i.e., a normal individual of the same species 
that does not have a disease involving aberrant immune regulation or a disease involving 
aberrant cell growth). If there is a difference in the two amino acid sequences, the 
individual suspected of having a disease involving aberrant immune regulation is 
identified as having a disease involving aberrant immune regulation, and may be treated 

25 accordingly. 

In certain embodiments of the tenth aspect, the amino acid sequence of the plexin 
polypeptide from the unaffected individual comprises the sequence of SEQ ID NO: 2, 4, 
6, 8 or 10. 

The following examples are intended to further illustrate certain preferred 
30 embodiments of the invention and are not limiting in nature. Those skilled in the art 
will recognize, or be able to ascertain, using no more than routine experimentation, 
numerous equivalents to the specific substances and procedures described herein. Such 
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equivalents are considered to be within the scope of this invention, and are covered by 
the following claims. 

Practice of the present invention will employ, unless indicated otherwise, 
conventional techniques of cell biology, cell culture, molecular biology, microbiology, 
5 recombinant DNA, protein chemistry, and immunology, which are within the skill of 
the art. Such techniques are described in the literature. See, for example, Molecular 
Cloning: A Laboratory Manual, 2nd edition. (Sambrook, Fritsch and Maniatis, eds.), 
Cold Spring Harbor Laboratory Press, 1989; DNA Cloning, Volumes I and II p.N. 
Glover, ed), 1985; Oligonucleotide Synthesis, (M.J. Gait, ed.), 1984; U.S. Patent No. 
—10 4,683,195 (Mullis et al„>; Nueleie Acid Hybridization (-B.D. -Hames and S.J. Higgins, 
eds.), 1984; Transcription and Translation (B.D. Hames and S.J. Higgins, eds.), 1984; 
Culture of Animal Cells (R.I. Freshney, ed). Alan R. Liss, Inc., 1987; Immobilized 
Cells and Enzymes, IRL Press, 1986; A Practical Guide to Molecular Cloning (B. 
Perbal), 1984; Methods in Enzymology, Volumes 154 and 155 (Wu et ah, eds), 

15 Academic Press, New York; Gene Transfer Vectors for Mammalian Cells (J.H. 

Miller and M.P. Calps, eds.), 1987, Cold Spring Harbor Laboratory; Immunochemical 
Methods in Cell and Molecular Biology (Mayer and Walker, eds.), Academic Press, 
London, 1987; Handbook of Experiment Immunology, Volumes I IV (D.M. Weir and 
C.C. Blackwell, eds.), 1986; Manipulating the Mouse Embryo, Cold Spring Harbor 

20 Laboratory Press, 1986. 

The following Examples are provided to illustrate the present invention, and 
should not be construed as limiting thereof. 
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EXAMPLES 

Example 1 

Identification and cDNA cloning of plexins and sequence analysis 

Since the coding sequences of human plexin-Bl (SEP), plexin-A2(OCT) and 
plexin-Al(NOV) were incomplete, we obtained the missing cDNA by RT-PCR; primers 
were designed by homology to orthologous murine sequences and corresponding ESTs. 
Updated database entries are X87904, X87831 and X87832, respectively. Partial cDNA 
of plexin-A4 was obtained by assembling five overlapping human ESTs (HGI THC 
Report: THC203425), deriving from chromosome 7 specific cDNA pools. Another EST 

from chr. -7 (clone 7B19F10) eneodes the cytoplasmic domain of a plexin and — - 

presumably derives from the same gene as plexin-A4. Plexin-B2 cDNA was amplified 
by RT-PCR starting from the partial cDNA sequences of clones MM1 (Shinoura, N., 
Shamraj, O.I., Hugenholz, H., Zhu, J.G., McBlack, P., Warnick, R., Tew, J J., Wani, 
M.A., and Menon, A.G. (1995). Identification and partial sequence of a cDNA that is 
differentially expressed in human brain tumors. Cancer Lett 89, 215-221) and 
KLAA03 15 (Genbank database); the genomic locus of SEP-B was identified due to its 
100% sequence identity with clone C22_311 from human chromosome 22. Plexin-B3 
coding sequence was identified in the genomic sequence of ALD locus on human 
chromosome Xq28, using the algorithms HEXON and GENIE. Plexin-Dl was similarly 
found in the genomic sequence of chromosome 3 (pac pDJ70ill). The genomic 
sequence of plexin-Bl (SEP), in the region of the alternative splicing of the extracellular 
domain, was obtained using the following primers: sense 
5'GC AGCACCTGTGC ACCCAC A AGGC3 ' and antisense: 
5TGCAGGCTGG ACGGGAGGATGAGG3 ' . The common donor site is 
CCATCAG/gtgattgt (position 2028 from ATG); the alternative splice acceptor sites are: 
(i) cccccttcag/AGCCC, leading to the canonical plexin-Bl sequence, and (ii) 
ctcctctcag/GTGAT, leading to "plexin-Bl truncated" variant. All these new sequences 
were analyzed using the algoritms BLAST2, NETPHOS (phosphorylation prediction 
sites, by Nicolaj Blom), PH-PREDICT and Consensus Protein Secondary Structure 
prediction at IBCP. The phylogenetic tree was generated using A11A11 algorithm of the 
Darwin sequence analysis system (at CBRG). 

Example 2 
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Plexin cDNA expression constructs and protein analysis 

Cell transfections were carried out by Calcium phosphate and DEAE-dextran 
methods, using 5-10 ug of each cDNA (1-2 ug each in case of cotransfections). For 
transient transfections in COS and BOSC-23 cells the cDNA was cloned in pCDNA3 or 

5 derived expression plasmids (Invitrogen). MDCK stable transfectants for plexin-AS 
were obtained using pCEP4 expression plasmid (Invitrogen); the selection was done in 
the presence of Hygromicin-B (100-200 ug/ml). Plexin-A3 positive clones of MDCK 
cells were isolated from two independent transfections, and showed identical biological 
properties. Plexin and neuropilin expression constructs included a VSV- and myc-tag at 

10 the N' and C protein termini", respectively, detected by monoclonal antibodies anti- 
VSV-G (cat. V-5507, Sigma) and anti-cMyc-tag (cat. OP10-100UG, Calbiochem). 
"Plexin-Bl truncated" splice variant was expressed from a cDNA fragment isolated by 
RT-PCR and VSV-tagged at the N' terminus: the encoded amino acid sequence spans 
up to aa 676 (including the sema domain and two MRS motifs). "Plexin-Bl -sema" 

15 derives from a further deletion of the plexin-B 1 extracellular domain, and exclusively 

includes the sema domain. "PlexinrBl-Asema". protein mutantJncluiies.only the C* 

terminal half of plexin-B 1 extracellular domain, starting from amino acid 606, i.e. 
excluding sema domain and first MRS but including second and third MRS, 
transmembrane and intracellular domains. 

20 For immunoprecipitations, cells were lysed with EB buffer (20 mM Tris-HCl 

pH 7.4, 5 mM EDTA, 150 mM NaCl, 10% glycerol, 1 % Triton X-100), in the presence 
of a cocktail of protease inhibitors and ImM Na-ortovanadate. Immunoprecipitations 
were performed at 4DC for 4h with the appropriate antibodies; high stringency washes 
were performed, in the presence of 1 M LiCl. 

25 For in vitro kinase assays, immunopurified proteins were incubated with kinase 

buffer (50 mM Hepes, 100 uM DTT, 5 mM MnCl 2 , 5 mM MgCl 2 ) in the presence of 
redivue 5 uCi [y- 32 P] ATP (Amersham) for 10 minutes at 4°C in agitation. Samples 
were then submitted to SDS-PAGE and autoradiography, or analysed using a Phosphor- 
Imager system (Molecular Dynamics). Alkali treatment of the polyacrilamide gels was 

30 performed with 1M KOH for two hours at 55°C 

Western blots were performed according to standard methods. Specific 
detection of phospho-tyrosines was done with PY20 MoAb (Trasduction laboratories). 
Final detection was done with ECL system (Amersham). 
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Example 3 

Semaphorin-SEAP binding assays 

Soluble forms of Semaphorin extracellular domains were expressed as chimeric 
molecules with placental Secreted Alkaline Phosphatase (SEAP) and harvested from 
the conditioned media of transiently transfected COS or BOSC-23 cells. Serum-free 
media were concentrated over 100 times using Centricon Plus-20 filters (Millipore) 
with a molecular weight cutoff. of 100 kDa. The AP activity of these media was 
assessed as described (Flanagan, J.G. and Leder, P. (1990). The kit ligand: a cell surface 
molecule altered in steel mutant fibroblasts. Cell 63, 185-194); the specific activity of 
..chimeric molecules was approx. 10.00 U/mg.jConcentrated£emaph^ 
diluted as appropriate in a Hepes buffered saline, additioned with 0.2% BSA, 0.1% 
NaN 3 , 5 mM CaCl 2 and 1 mM MgCl 2 (HBSBA). For binding assays, COS cells 
transiently transfected with plexins were seeded on 48 well plates to reach confluence, 
and then incubated with Semaphorin-SEAP preparations (approx 1-5 nM) for 90 
minutes at room temperature. The binding was detected as described (Flanagan and 
Leder, 1990). Binding experiments with plexin-Cl/VESPR were as described (Comeau, 
M.R., Johnson, R„ DuBose, R.F., Petersen, M., Gearing, P., VandenBos, T., Park, L., 
Farrah, T., Buller, R.M., Cohen, J.L, Strockbine, L.D., Rauch, C, and Spriggs, M.K. 
(1998). A poxvirus-encoded semaphorin induces cytokine production from monocytes 
and binds to a novel cellular semaphorin receptor, VESPR. Immunity. 5, 473-482; He, 
Z. and Tessier-Lavigne, M. (1997). Neuropilin is a receptor for the axonal 
chemorepellent Semaphorin m. Cell 90, 739-751). 

For in vitro binding assays, plexin-Bl was purified from cell extracts by 
immunoprecipitation with anti-VSV antibody. Extracts of mock-transfected cells were 
used as control samples. After washing, the immunocomplexes were incubated with 
serial dilutions of CD100-SEAP (prepared as above) for 2 hours at 4°C, in continuous 
agitation. Samples were then washed 3 times with HBSBA and the bound alkaline 
phosphatase activity was measured by colorimetric assay using p-nitro-phenyl- 
phosphate, as described (Flanagan and Leder, 1990). Scatchard analysis was done using 
Equilibrate (by GertJan C. Veens tra). 
Example 4 

In situ hybridization analysis^ 
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RNA in situ hybridization was performed essentially as described (He, Z. and 
Tessier-Lavigne, M. (1997). Neuropilin is a receptor for the axonal chemorepellent 
Semaphorin m. Cell 90, 739-751). Briefly cDNA fragments of plexin-Al, -A2, and - 
A3 were used to generate 35 S-labeled antisense and sense RNA probes, which were 
used for in situ hybridization histochemistry of cryostat sections of rat embryos. 
Example 5 

Xenopus turning assay 

The methods for injecting mRNA encoding various constructs, and for studying 
the turning responses of the neurons, are exactly as described previously (Hong, K., 
Hinck, L., Nishiyama, M., Poo, M.M., Tessier=Lavigne, M.,-ancLStein, E. (1999). A _. 
ligand-gated association between cytoplasmic domains of UNC5 and DCC family 
receptors converts netrin-induced growth cone attraction to repulsion. Cell 97, 927- 
941); Song, H, Ming, G., He, Z., Lehmann, M, Tessier-Lavigne, M., and Poo, M. 
(1998). Conversion of neuronal growth cone responses from repulsion to attraction by 
15 cyclic nucleotides [see comments]. Science 281, 1515-1518). 
Example 6 

Mixed-culture assays and time-lapse videomicroscopy 

Mock-transfected and plexin-A3 overexpressing MDCK cells were seeded with 
mesenchymal cells (NM 3T3, KJ29, D17, among others), in multiwell culture plates by 
1 :4 or 1:1 ratio. NIH and KJ-29 cells were sometimes labeled by addition of DU (1,1'- 
dioctadecyl-3,3,3',3'-tetramethylindocarbocyanine perchlorate, Fluka) in the culture 
medium, 4 hours before harvesting for the assay; clusters of cells marked with this dye 
are marked in blue (in light microscopy) and emit red epifluorecence (TRITC filter). 
The repelling effect was observed 16-30 hours after confluency, by contrast phase 
microscopy using Leica DM IL. The progress of the assays was also monitored by time- 
lapse video-microscopy (320 minutes recording were converted into 1 minute play). To 
determine the time-length of cell contacts, for each assay, randomly chosen fibroblasts 
were followed during several hours and the duration of each contact between their 
lamellipodia and MDCK cells was measured. The doubling time of cells and their 
viability during the assay could also be analyzed, and no differences were observed in 
presence of control or plexin-A3 expressing cells. Substrate adhesion of plexin-A3 
overexpressing MDCKs was analyzed by counting attached cells after 30 minutes from 
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seeding on micro-wells coated with fibronectin, collagen or polylysin, in the absence of 
calf serum: no differences versus control cells were observed. 
Example 7 
Apoptosis detection 

5 TUNEL reaction (Boehringer detection kit) was performed on mixed coltures of 

MDCK and NIH3T3 cells, 24 hours after seeding in a 24- well culture plate. The 
labeling was converted into a colorimetric signal for analysis by light microscopy using 
the TUNEL-AP detection kit (Boehringer). As a positive control for the induction of 
apoptosis, the same cells were treated with UV-C (50 yJ/cm 2 ) or ljxM staurosporin. 

10 """"Although the foregoing invention has been described in- some detail by way of 

illustration and example for purposes of clarity of understanding, it will be apparent to 
those skilled in the art that certain changes and modifications will be practiced. 
Therefore, the description and examples should not be construed as limiting the scope 
of the invention, which is delineated by the appended claims. 

15 Example 8 

Plexins are specific receptors for cell surface semaphores in vertebrates 

Plexin-Cl (VESPR) has been shown to bind the soluble viral semaphores Sema 
VA and VB (Comeau et al., 1998 supra), and we recently found that Drosophila Piexin 
A (D Plex A) interacts with transmembrane Sema la (Winberg et al., 1998 supra). We 

20 therefore examined in vertebrates whether the extracellular domain of several different 
cellular semaphorins -fused to alkaline phosphatase- could bind members of the human 
plexin-A, -B and -C subfamilies. Multiple secreted semaphorins of class 3 (Sema3A, 
Sema 3C or Sema3F; see below) did not interact with plexins-Al, -A2, -A3, -Bl, B2, or 
-CI (data not shown). In contrast, plexin-Cl(Vespr) specifically bound Sema7A(Sema- 

25 Kl) (Fig. 2a), a GPI-membrane linked semaphorin (class 7). This result is not entirely 
unexpected, since Sema7A may represent the cellular counterpart of viral semaphorin 
SemaVB, previously shown to interact with this piexin (Comeau et al., 1998 supra). 
More interestingly, the class 4 transmembrane semaphorin Sema4D (CD100) did 
interact strongly and specifically with plexin-B 1 (Fig. 2a). Thus the prototypes of two 

30 distinct piexin families are the receptors for members of two distinct semaphorin sub- 
classes. We also found that Sema7A and Sema4D do net bind to neurcpilin-1 or -2 
— - atone, nor did co-transfection of either neuropilin witbplexin-B 1 significantly modify 
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its binding efficiency (not shown). Neuropilins thus seem so far to function as receptors 
only for vertebrate semaphorins of class 3. 

The affinity constant of Sema4D for plexin-Bl was estimated by Scatchard plot 
to be in the subnanomolar range (K D = 0.9 nM, Fig. 2b; the estimated K D of Sema7A 
5 for plexin-Cl is 2.1 nM, not shown). These values are consistent with those observed 
for semaphorins-neuropilins, and fly semaphorin 1-Plexin A interactions (He and 
Tessier-Lavigne, 1997 swpraWinberg et al., 1998). 

We used two deletion constructs of plexin-Bl to explore the semaphorin 
binding sites of plexins. Neither the N-terminal half of plexin-Bl extracellular domain 
10 ("plexin-Bl truncated", see previous paragraph), nor its C-terminal half ("plexin-Bl- 
Asema", see Experimental Procedures) was sufficient alone to bind CD 100 (see Fig. 
2a), suggesting that the binding of Sema4D depends on multiple structural determinants 
of the extracellular domain of plexin-Bl. 
Example 9 

15 Plexins associate with class 3 Semaphorin receptors, N europilins 

As outlined_abov-e^secreted semaphorins_of subclass 3 are_known-to hind 
neuropilins (He and Tessier-Lavigne, 1997 supra; Kolodkin et al., 1997 supra; Chen, 
R, Chedotal, A., He, Z., Goodman, C.S., and Tessier-Lavigne, M. (1997) "Neuropilin- 
2, a novel member of the neuropilin family, is a high affinity receptor for the 

20 semaphorins Sema E and Sema IV but not Sema EL" Neuron 19, 547-559). However, 
the short cytoplasmic tail of neuropilins seems to be dispensable for their biological 
activity (Nakamura, et al (1998) supra), indicating the requirement of an associated co- 
receptor for signal transduction. Interestingly, in Drosophila (where neuropilins have 
not been identified to date) Plexin A is sufficient to mediate the biological response to 

25 semaphorin- 1 in axon guidance (Winberg et al., l99Ssupra). 

In an initial set of experiments, we could not observe binding of the class 3 
semaphorins Sema3A(Sema EI), Sema3C(Sema E) or Sema3F(Sema IV) to plexins-Al, 
A2, A3, Bl, B2 or CI (not shown). To test whether plexins might be coreceptors with 
neuropilins for class 3 semaphorins, we set up co-precipitation experiments in COS 

30 cells to test whether neuropilins may interact with plexins. Three tested plexins (plexin- 
Al, -A3 and -Bl) associated both with neuropilin-2 (Np2, shown in Fig. 3) and 
neuropilin- 1 (not shown). The binding was specific, inasmuch as neither neuropilin nor 
any plexins coimmunoprecipitated with the netrin receptor DCC, under conditions 
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where DCC coimmunoprecipitated with the other netrin receptor UNC5H2 (Fig. 3 and 
data not shown). We observed finally that the plexin-neuropilin association is mediated 
by the sema domain of plexins, as demonstrated using either the "plexin-Bl truncated" 
splice variant (Figure 3) or an even shorter form of the extracellular domain ("plexin- 
5 Bl-sema", see Experimental procedures, not shown). 

To further support the idea of a plexin-neuropilin multimeric receptor complex 
for semaphorins, we show here that plexin-A3 (e.g.) is expressed in a large number of 
neuronal classes, including sensory, sympathetic, motor, and olfactory bulb neurons 
(Figure 4 and data not shown), which are known to respond to class 3 semaphorins, and 

-10 which express either neuropHin-1 or neuropilin-2 or both (Ghen et al.,-1997 supra; 

. ^ Feiner, L., Koppel, A.M., Kobayashi, H., and Raper, J.A. (1997). Secreted chick 

semaphorins bind recombinant neuropilin with similar affinities but bind different 
subsets of neurons in situ. Neuron 79, 539-545; He and Tessier-Lavigne, 1997 supra\ 
Kolodkin et al., 1997 supra). Thus, plexin-A3 is a candidate for a physiological 
15 coreceptor involved in mediating class 3 semaphorin effects on these axons. Other 

plexins may also have a role as neuropilin coreceptors in specific cell populations, such 
as plexin-A2, which is expressed in a subset of sensory neurons and in dorsal horn 
cells, and plexin-Al, which is expressed at low levels and broadly in the spinal cord 
(Figure 4). 

20 To directly test the possible involvement of plexins in class 3 semaphorin signal 

transduction, we studied the repulsive responses of Xenopus spinal neurons to Sema3A, 
) which is mediated by a receptor mechanism involving neuropilin- 1 (Song et al., 1998 

supra). We asked whether these responses could be altered by expression of a 
presumed dominant-negative plexin-Al construct lacking the cytoplasmic domain of 

25 the protein. Transmembrane proteins can be reliably expressed in these neurons by 
injecting the encoding mRNA at the developmental two cell stage, allowing the 
embryos to grow to tadpole stage, and then removing the spinal cord and culturing the 
neurons (Hong et al., 1999 supra). We therefore injected the mRNA encoding the 
truncated plexin-Al construct, together with mRNA encoding GFP (as a reporter) and 

30 then studied the responses of spinal neurons expressing GFP that were derived from 
these embryos. Whereas control spinal neurons are repelled by Sema3 A (Figure 5A, B 
- - . and Song et al. 1998 supra)*. neurons from embryos injected with mRNA for truncated ..... 

plexin-Al did not respond with either repulsion or attraction to Sema3A (Figure 5C, 
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D). This blocking effect appeared to be specific, since expression of a different 
heterologous receptor, UNC5H2, did not impair repulsion by Sema3A (Hong et aL, 
1997 supra), and since expression of the truncated plexin construct did not block 
attractive responses to netrin-1 (Figure 5E, F). Figure 5G, H quantifies these effects. 
5 As can be seen, the effect of Sema3A is completely abolished by the truncated plexin; 
although there is a slight apparent decrease in the attractive effect of netrin-1 the effect 
is not statistically significant. 

Although we have used a truncated plexin-Al construct, this construct may be 
expected to interfere with the function of various plexins, since all the plexins tested 
10 (Al, A3 andBl) associated with neuropilin-1. These results support a role for one or 

more plexins in mediating the repulsive Sema3A signal in the Xenopus spinal neurons. ) 
Example 10 

Plexins signal via a novel type of tyrosine phosphorylated cytoplasmic domain 

The sequences of plexin cytoplasmic domains are highly conserved among 

15 plexins but do not match any known sequences. We found that the plexin-A3 and 

plexin-B 1 proteins are phosphorylated on, tyrosine residues when joyerexpressed in 

human kidney cells (BOSC-23), as demonstrated using anti-phosphotyrosine antibodies 
(Fig. 6a). Furthermore, after immunoprecipitation and in vitro kinase assays, plexin-A3 
and plexin-B 1 became phosphorylated (Fig. 6b). Resistance to an alkali treatment (see 

20 Experimental procedures) confirmed the specific phosphorylation of tyrosine residues. 

The cytoplasmic domains of several receptors, including Met proteins, become 
tyrosine phosphorylated owing to an intrinsic kinase activity (Ullrich, A. and j 
Schlessinger, J. (1990) "Signal transduction by receptors with tyrosine kinase activity." 
Cell 61 y 203-212). Since the cytoplasmic domain of plexins is not similar to any bona 

25 fide or atypical tyrosine kinase, this suggests that a distinct tyrosine kinase co- 

immunoprecipitates in association with plexins, and is responsible for their tyrosine 
phosphorylation. Although some additional phosphorylated proteins can be found 
specifically with plexin-A3 and -Bl, we have not as yet identified this associated 
kinase. A number of endogenously expressed tyrosine kinases, namely Met, Ron, Abl 

30 and Src, were not found associated with plexin-A3 by immunoprecipitation and 

Western blotting (not shown). Since tyrosine phosphorylated residues often function as 
docking sites for intracellular signal transducers (Cantley, L.C., Auger, K.R., Caipenter, 
C, Duckworth, B., Graziani, A., Kapeller, R., and Soltoff, S. (1991) "Oncogenes and 
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signal transduction." Cell 64, 281-302), the fact that the cytoplasmic domains of 
plexins are tyrosine phosphorylated further suggests that they are part of signaling 
complexes. 
Example 1 1 

5 . Plexin-A3 expressing cells induce repulsion of co-cultured cells 

Stable transfectants expressing recombinant human plexin-A3 were successfully 
obtained in four different cell lines: IMR32 and AF8 (human neuroblasts), and BOSC- 
23 and MDCK (human and canine kidney cells, respectively). We observed modest 
phenotypic changes in the transfected cells, which generally become flatter and larger in 

- -10 size. The growth-rate of plexin- A3 overexpressing cells-was comparable Jx> parental 

lines and we did not observe differences in the ability to adhere on different substrates 
(data not shown). 

In keeping with previous report on the related Plexin of Xenopus laevis (Ohta, 
K., Mizutani, A., Kawakami, A., Murakami, Y., Kasuya, Y., Takagi, S., Tanaka, H., 

15 and Fujisawa, H. (1995). "Plexin: a novel neuronal cell surface molecule that mediates 
cell adhesion via a homophilic binding mechanism in the presence of calcium ions." 
Neuron 14, 1189-1199), we observed a modest increase in calcium-dependent 
homotypic cell aggregation of plexin- A3 transfectants (not shown). Surprisingly, we 
found that epithelial MDCK cells overexpressing plexin- A3 mediate strong repelling 

20 cues for adjacent cells. This was observed by co-culturing mock-transfected and plexin- 
A3 overexpressing MDCK cells together with several non-epithelial cell lines (such as 
NIH3T3, Kj29, and D17; Fig. 7A). Mock MDCKs grew alongside mesenchymal cells 
until confluency, when both cell types stopped proliferating. In contrast, when plexin- 
A3-overexpressing epithelial cells were grown in the same conditions, the adjacent 

25 mesenchymal cells withdrew from them, and ultimately detached from the plate. 

To analyze the dynamics of this repulsion process, we monitored for 36 hours, by 
time-lapse video-microscopy, mixed cultures of transfected MDCK cells and 
fibroblasts, in a number of independent experiments. At low cell density, fibroblasts 
showed intrinsic motility, exploring the surface of the plate with long lamellipodia and 

30 filopodia, and thus coming in contacts with a high number of stationary MDCK islets. 
The time-length of the contacts between fibroblasts and control MDCK cells varied 
from 30 minutes to several hours, lasting mostly over 100 minutes. However, when 
fibroblasts were cultured with MDCK cells overexpressing plexin-A3, transient 
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contacts were observed, often lasting less than 30 minutes (see Fig. 7C). At higher cell 
density, fibroblasts stopped and clustered alongside the islands of control MDCKs, 
whereas they kept moving in a hectic fashion between the islands of plexin-A3 
transfected cells (data not shown). 
5 This cell-repelling effect is not due to the release of soluble factors, since 

exchanging conditioned media between mixed cultures was without effect (not shown). 
Moreover, the two different cell populations grew normally until they came into 
contact, indicating that the repelling effect requires cell-cell interaction. To rule out the 
possibility that plexin-A3 expressing cells generate an apoptotic signal for fibroblasts, 

10 we monitored cell viability and apoptosis by T-UNEL staining. As shown in Figure 7B, 
the clusters of repelled fibroblasts did not include apoptotic cells; furthermore, the 
detaching cells still excluded Trypan blue stain and were able to spread again on a new 
culture plate (not shown). 

Taken together, these results demonstrate that in our experimental system, plexin- 

15 A3 mediates cell repelling cues, presumably by interacting with surface bound ligands 
on opposing cells. We could not identify -so far- the specific ligand for plexin-A3, 
however we propose that this may be a transmembrane semaphorin. It should be noted 
that the intracellular domains of transmembrane semaphorins, such as Sema4D, also 
include tyrosine residues, which may themselves become phosphorylated and associate 

20 with cytoplasmic signal transducer molecules, a property shown for ligands of the 
ephrin family (Holland, S.J., Gale, N.W., Mbamalu, G., Yancopoulos, G.D., 
Henkemeyer, M., and Pawson, T. (1996) "Bidirectional signalling through the EPH- 
family receptor Nuk and its transmembrane ligands." Nature 383, 722-725). 
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What is claimed is: 

1. Isolated nucleic acid having at least 80% nucleic acid sequence identity to a 
nucleotide sequence that encodes an amino, acid sequence selected from the 
group consisting of the amino acid sequence shown in (SEQ ID NO: 2 
(plexin B-2)), (SEQ ID NO: 4 (plexin B-3», (SEQ ID NO: 6 (plexin D-l)) 
and (SEQ ID NO: 8 (plexin A-4)) 

2. Isolated nucleic acid having at least 80% nucleic acid sequence identity to a 
nucleotide sequence selected from the group consisting of the nucleotide 

- - sequence shown (SEQ ID NO: 1 (plexin B-2)), (SEQ -ID NO: 3 (plexin -B- - 
3)), (SEQ ID NO: 5 (plexin D-l)) and (SEQ ID NO: 7 (plexin A-4)). 

3. A vector comprising the nucleic acid of any one of claims 1 or 2. 

4. An isolated polypeptide having at least 80% amino acid sequence identity to 
an amino acid sequence selected from the group consisting of the amino acid 
sequence shown in (SEQ ID NO: 2 (plexin B-2)), (SEQ ID NO: 4 (plexin B- 

. 3))_,_ (SEQ roj^: 6Jplexin D-l)} and LSEQ JD NO: 8 Mexin A-4)). 

5. An isolated polypeptide having at least 80% amino acid sequence identity 
to: 

(a) the polypeptide shown in (SEQ ID NO: 2 (plexin B-2)), (SEQ ID 
NO: 4 (plexin B-3)), (SEQ ID NO: 6 (plexin D-l)) and (SEQ ID NO: 8 (plexin 
A-4)), lacking its associated signal peptide; 

(b) an extracellular domain of the polypeptide shown in (SEQ ID 
NO: 2 (plexin B-2)), (SEQ ID NO: 4 (plexin B-3)), (SEQ ID NO: 6 (plexin D- 
1)) and (SEQ ID NO: 8 (plexin A-4)), with its associated signal peptide; or 

(c) an extracellular domain of the polypeptide shown in (SEQ ID 
NO: 2 (plexin B-2)), (SEQ ID NO: 4 (plexin B-3)), (SEQ ID NO: 6 (plexin D- 
1)) and (SEQ ID NO: 8 (piexin A-4)), lacking its associated signal peptide. 
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6. A chimeric molecule comprising a polypeptide according to claim 4 or 5 
fused to a heterologous amino acid sequence. 

7. The chimeric molecule of claim 6, wherein the heterologous amino acid 
sequence is a Fc region of an immunoglobulin. 

5 8. An antibody that specifically binds to a polypeptide according to claim 4 or 

5. 

9. The antibody according to claim 8, wherein the antibody is a monoclonal, a 
humanized antibody or a single-chain antibody. 

10. A method of suppressing or altering aberrant cell growth involving a 
10 signaling pathway between a plexin and a neuropilin in a mammal 

comprising the step of administering an effective amount of an agent to said 
mammal capable of interfering with the association between the plexin and 
neuropilin 

1 1. A method of treating, suppressing or altering a disorder involving aberrant 
15 immune regulation involving a signaling pathway between a plexin and a 

neuropilin in a mammal comprising the step of administering an effective 
amount of an agent to said mammal capable of interfering with the 
association between the plexin and neuropilin. 

12. The method according to claim 10 or 11 wherein said agent is a chimeric 
20 molecule according to claim 6 or 7. 

13. The method according to claim 10 or 11, wherein said agent is an antibody 
according to claim 8 or 9. 

14. A method of diagnosing or screening for tumors in a subject characterized 
by the expression profiles of the polypeptides according to claim 4 or 5 
wherein the expression profile of the polypeptides is different in a non- 
tumor sample as compared to the expression profile of the polypeptides m a 
tumor sample. 
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NIH-3T3 NIH-3T3 




Omin 50 min 100 min 150 min 




control Plexin-A3 



Persistent contacts Transient contacts Very transient contacts 
(over 1 00 min) (less than 1 00 min) (less than 30 min) 
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SEQUENCE LISTING 

<110> University of Torino 

<12 0> Novel Plexins and Uses Thereof 



<130> A077PCT 

<140> Not assigned yet 
<141> 2000-08-25 

<150> 60/150576 
<151> 1999-08-25 

<160> 12 

<17 0> FastSEQ for Windows Version 4.0 



<210> 1 

..--c211-> -6252 — 

<212> DNA 

<213> HOMO SAPIEN 



<400> 1 
gcggggggca 
tgccagcctg 
ggctgtggat 
ggatgcgaag 
gtgcacgccg 
ccagctgctg 
gggcatctgc 
cagcggggag 
gagctccacg 
cgacaacggc 
tgaagcctac 
gttcgtggcg 
gcacccggcc 
ctcctacctg 
cacctgcctg 
cagcagagac 
caaggtgcac 
ccgtgacatc 
ctccagcaag 
cgggctcaga 
cgccgccgag 
ggtgtacctc 
caagagagtc 
ccaggacaag 
gtgccgcgac 
gaaggccgag 
ctgcgtggcc 
gcagctgacc 
tggggagtcg 



atggcactgc 
aggccccgca 
gaggcctcag 
ctgcagctgg 
cccatcgagg 
ctgctcgacc 
gctctgcgcg 
aagtctttcg 
ggtcctggtg 
-atcatcgtga- 
acggaccacg 
gccttcgagg 
cggaaccgca 
gagatggacc 
gccgcctccg 
agccggagca 
gccaagatgg 
ttctacaagc 
agcttcccat 
ggcacagccg 
aacaaccaca 
accccagatg 
aagcgcgacc 
gtgttccggc 
tcccaggacc 
tgtccgcggg 
gtcaccagcg 
gtcagccccc 
ccgccacacc 



agctctgggc 

agctggactt 

gcgtggtgta 

agcagcaggt 

ccagccagtg 

ctcccaggaa 

ccctgagcaa 

tggccagcaa 

gtgaccgcgt 

gcactcggct- 

ccacctacaa 

acggccccta 

cgctgctggc 

tgcagtgccg 

tggctgcgcc 

gtggggggcc 

aggccaaccg 

ccttccacgg 

gtggctcgga 

tgctgcagcg 

ctgttgcttt 

gcacctcctc 

tggtactgtc 

tgccggtgca 

cctactgcgg 

ccgaggaggc 

cccagccaca 

tccctgccct 

ccgcccgcgt 



cctgaccctg 
cttccgcagc 
cctgggggcg 
ggccacgggc 
ccatgaggct 
gcgcctggtg 
catctccctc 
tgatgagggc 
gctgtttgtg 
gttggaccgg 
ggccggctac 
cgtcttcttt 
acgcatgtgc 
ggaccccgac 
tggctctggc 
cggtgcgggc 
caacgcctgt 
cgatatccag 
gcacctgccc 
tggaggcctg 
tctgggcacc 
agagtacgac 
tggagacctg 
ggagtgcctg 
ctggtgcgtc 
cagccactgg 
gaacatgagc 
gagcgaggag 
ggagggcgag 



ctgggcctgc 
gagaaagagc 
gtgaatgccc 
ccggccctgg 
gagatgactg 
gagtgcggca 
cgcctgttct 
gtggccacag 
ggcaaaggca 
actgacagca 
ctgtccacca 
gtcttcaacc 
agagaagacc 
atccacgccg 
agggtgctat 
ctctgcctgt 
tacacaggca 
tgcggcggcc 
tacccgctgg 
aacctcacgg 
tctgatggcc 
tctatccttg 
ggcagcctgt 
agctacccga 
gtcgagggac 
ctgtggagcc 
cggcgggccc 
gacgagttgc 
gccgtcatct 



tgggcgcagg 

tgaaccacct 

tctaccagct 

acaacaagaa 

acaatgtcaa 

gcctcttcaa 

acgaggacgg 

tggggctggt 

atgggccaca 

-gggaggcctt- 

acacacagca- 

agcaggacaa 

ccaactacta 

ctgcctttgg 

atgctgtctt 

tcccgctgga 

cccgggaggc 

acgcgccggg 

gcagccgcga 

ccgtgacggt 

ggatcctcaa 

tggagataaa 

acgccatgac 

cctgcaccca 

gatgcacccg 

gaagcaagtc 

agggggaggt 

tgtgcctttt 

gcaactcccc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
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aagcagcatc cccgccacac cgccaggcca ggaccacgtg gccgtgacca tccagctcct 1800 

cctcagacga ggcaacatcc tcctcacgtc ctaccagtac cccttctacg actgccgcca 1860 

ggccatgagc ctggaggaga acctgccgtg catctcctgc gtgagcaacc gctggacctg 1920 

ccagcgggac ctgcgctacc acgagtgccg ggaggcttcg cccaaccctg aggacggcat 1980 

cgtccgcgcc cacatggagg acagctgtcc ccagttcctg ggacccagcc ccctggtgat 2040 

ccccacgaac cacgagacag atgtgaact.t ccagggcaag aacctggaca ccgtgaaggg 2100 

ttcctccctg cacgcgggca gtgacttgct caagttcatg gagccggtga ccatgcagga 2160 

acccgggacc ttcgcctttc ggaccccaaa gctgtcccac gatgccaacg agacgctgcc 2220 

cccgcccctc tacgtcaagt cttacggcaa gaatatcgac agcaagcccc atgtgaccct 2280 

ccacaactgc tcccttggcc gcagcgactg cagcctgtgc cgggccgcta accccgacta 2340 

caggtcrgcg tggngcgggg gccagagcag gtgcgtgtat gaggccctgt gcaacaccac 2400 

ccccgagtgc ccgccgcccg tcatcaccag gacccagcct gagacgggcc ccctgggtgg 2460 

gggcacccgc .atcaccatcc tggggtccaa ttcgggcgtc caagcagggg acatccagag 2520 

garctccgtg gccggccgga actgctcctt tcagccggaa cgttactccg tgtccacccg 2580 

gauccwgtgt gtgatcgagg ctgcggagac gcctttcacg gggggcgtcg aggtggacgt 2640 

ctccgcgaaa ctgggccgtt cgcctcccaa tgtccagttc accttccaac agcccaagcc 2700 

tczcag-gtg gagccgcagc agggaccgca ggcgggcggc accacactga ccatccacgg 2760 

cacccacctg gacacgggct cccaggagga cgtgcgggtg accctcaacg gcgtcccgtg 2820 

taaagzgacg aagtttgggg cgcagctcca gtgtgtcact ggcccccagg cgacacgggg 2880 

ccagacgctt ctggaggtcc cctacggggg gtcccccgtg cccaaccccg gcatcttctt 2940 

cacctaccgc gaaa'accccg tactgcgagc cttcgagccg ctacgaagct ttgccagtgg 3000 

. _:tggccccagc .ajtcaacgtca. cgggtcaggg c.ttcagccig .atccagagg.t .ttgcca_t^gx 3.06.0 

ggccaccgcg gagcccctgc agtcctggca gccgccgcgg gaggctgaat ccc tgcagcc 312 0 

cacgacggtg gtgggtacag actacgtgtt ccacaatgac accaaggtcg tcttcctgtc 3180 

cccggc-gcg cctgaggagc cagaggccta caacctcacg gtgctgatcg agatggacgg 3240 

gcaccgigcc ctgctcagaa cagaggccgg ggccttcgag tacgtgcctg accccacctt 3300 

tgagaactcc acaggtggcg tcaagaagca ggtcaacaag ctcacccacg cccggggcac 33 60 

caacccgaac aaggcgatga cgctgcagga ggccgaggcc ttcgtgggtg ccgagcgctg 3420 

caccargaag acgctgacgg agaccgacct gtactgtgag cccccggagg tgcagccccc 3480 

gcccaagcgg cggcagaaac gagacaccac acacaacctg cccgagttca ttgtgaagtt 3540 

cggctczcgc gagtgggtgc tgggccgcgt ggagtacgac acacgggtga gcgacgtgcc 3600 

gcrcaccctc atcttgccgc tggtcatcgt gcccatggtg gtcgtcatcg cggtgtctgt 3660 

ctactgctac tggaggaaga gccagcaggc cgaacgagag tatgagaaga tcaagtccca 3720 

gccggagggc ctggaggaga gcgtgcggga ccgctgcaag aaggaattca cagacctgat 3780 

garcgagatg gaggaccaga ccaacgacgt gcacgaggcc ggcatccccg tgctggacta 3840 

eaagacctac accgaccgcg tcttzcttcct gccctccaag gacggcgaca aggacgtgat 3900 

gaccaccggc aagctggaca tccctgagcc gcggcggccg gtggtggagc aggccctcta 3960 

ccagttctcc aacctgctga acagcaagtc tttcctcatc aatttcatcc acaccctgga 4020 

gaaccagcgg gagttctcgg cccgcgccaa ggtctacttc gcgtccctgc tgacggtggc 4080 

gccgcacggg aaactggagt actacacgga catcatgcac acgctcttcc tggagctcct 4140 

ggagcagtac gtggtggcca agaaccccaa gctgatgctg cgcaggtctg agactgtggt 4200 

ggagacgatg ctgtccaact ggatgtccat ctgcctgtac cagtacctca aggacagtgc 4260 

cggggagccc ctgtacaagc tcttcaaggc catcaaacat caggtggaaa agggcccggt 4320 

ggatgcggta cagaagaagg ccaagtacac tctcaacgac acggggctgc tgggggatga 4380 

tgrggagtac gcacccctga cggtgagcgt gatcgtgcag gacgagggag tggacgccat 4440 

cccggcgaag gtcctcaact gcgacaccat ctcccaggtc aaggagaaga tcattgacca 4500 

ggrgtaccgt. gggcagccct gctcctgctg gcccaggcca gacagcgtgg tcctggagtg 4560 

gcgtccgggc tccacagcgc agatcctgtc ggacctggac ctgacgtcac agcgggaggg 4620 

ccggtggaag cgcgtcaaca cccttatgca ctacaatgtc cgggatggag ccaccctcat 4680 

ccrgtccaag gtgggggtct cccagcagcc ggaggacagc cagcaggacc tgcctgggga 4740 

gcgccaegcc ctcctggagg aggagaaccg ggtgtggcac ctggtgcggc cgaccgacga 4800 

ggcggacgag ggcaagtcca agagaggcag cgtgaaagag aaggagcgga cgaaggccat 4860 

caccgagatc tacctgacgc ggctgctctc agtcaagggc acactgcagc agtttgtgga 4920 

caactccttc cagagcgtgc tggcgcctgg gcacgcggtg . ccacctgcag tcaagtactt 4980 

ctccgacttc ctggacgagc aggcagagaa gcacaacatc caggatgaag acaccatcca 5040 
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catctggaag 
cttcatcttt 
gaccttcatg 
caagctgctg 
ggggatccgg 
ttcccgggcg 
cacgcagaag 
gatgcagctg 
cctctgacct 
cccccagggg 
cccaagccgg 
tgagcaatac 
agaaccagca 
gccacgctgg 
ctaggttggg 
tatgtctgtc 
acctcaccgg 
acgtcaaagg 
tcctggtctg 
ttccaagacc 
tttttcactc 



acgaacagct 
gacgtgcatg 
gatgcctgca 
tacgccaagg 
cagatggtgc 
cacacggact 
tactatgacg 
gccttccgcc 
acaatctcca 
aggaggccga 
agtgggtgca 
cgccgggcac 
tcgggtgttc 
gggcagctgg 
cccctggggg 
tgtccaccac 
cctccccaag 
tcaagtgaga 
tcagacaggc 
accccacccc 
eg 



taccgctccg 
tccacgaggt 
cgcgcacgga 
agatctccac 
aggtcagega 
ccttgaacac 
agatcatcaa 
tgcagcagat 
gcgctgcctt 
gtggctgtgg 
gccggaaccc 
cgcaccagca 
actgtcgagt 
cctccgcctc 
atctgagggc 
gagatgggag 
ggtgccggca 
cgtaggtcaa 
tgecctagag 
tttttgtaaa 



gttctgggtg 
ggtggacgcc 
gcataagctg 
ctacaagaag 
ccaggacatg 
cctcgtggca 
tgccttggag 
tgccgctgca 
gggacatagg 
ctgaggcccc 
gcccagcgtc 
ccagccccag 
ctcgagtgat 
cgcccacgca 
ctgtggcccc 
gaggagaaaa 
ctctgggtgg 
gtcctacgtc 
ccccacccag 
tcttgttcat 



aacatcctca 
tcgctgtcag 
agecgegatt 
atggcggagg 
aacacacacc 
ctccaccagc 
gaggatcctg 
ctggagaaca 
tacctgaggt 
caccctcccc 
tagactgtag 
ccccagctcc 
ttgaaaatgt 
ccagcagccg 
cagggcaagt 
ageggtaega 
actcacggct 
ggggeccaga 
teegggggga 
tgtaaatcaa 



agaaccccca 
tcatcgcgca 
ctcccagcaa 
attactacaa 
tggcagagat. 
tctaccaata 
ccgcccagaa 
aggtcactga 
acctgagagc 
tggaacgege 
catcttcctc 
ctccggccgc 
gccttacgct 
cctccatgcc 
tcccagatcc 
tgccttcctg 
gctgggcccc 
catcctgggg 
ctgggagcag 
atacagegtc 



5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6252 



<210> 2 

<211> 1838 

<212> PRT 

<213> HOMO SAPIEN 



<400> 2 
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195 200 205 

Gly Tyr Leu Ser Thr Asn Thr Gin Gin Phe Val Ala Ala Phe Glu Asp. 

210 215 220 

Gly Pro Tyr Val Phe Phe Val Phe Asn Gin Gin Asp Lys His Pro Ala 
225 . 230 - 235 .240 

Arg Asn Arg Thr Leu Leu Ala Arg Met Cys Arg Glu Asp Pro Asn Tyr 

245 250 , 255 

Tyr Ser Tyr Leu Glu Met Asp Leu Gin Cys Arg Asp Pro Asp He His 

260- 265- 270 . - 

Ala Ala Ala Phe Gly Thr Cys Leu Ala Ala Ser Val Ala Ala Pro Gly 

275 ' 280 285 

Ser Gly Arg Val Leu Tyr Ala Val Phe Ser Arg Asp Ser Arg Ser Ser 

290 295 300 

Gly Gly Pro Gly Ala Gly Leu Cys Leu Phe Pro Leu Asp Lys Val His 
305 * 310 315 320 

Ala Lys Met Glu Ala Asn Arg Asn Ala Cys Tyr Thr Gly Thr Arg Glu 

325 330 335 

Ala Arg Asp He Phe Tyr Lys Pro Phe His Gly Asp He Gin Cys Gly 

340 ~ " 345 350 

Gly His Ala Pro Gly Ser Ser Lys Ser Phe Pro Cys Gly Ser Glu His 
355 360 365 

-Leu Pro-Tyr -Pro. Leu -Gly Ser -Arg AspJ32y._Leu .Ar^ „G^y _Ala Val_ 
370 375 380 

Leu Gin Arg Gly Gly Leu Asn Leu Thr Ala Val Thr Val Ala Ala Glu 
385 390 395 400 

Asn Asn His Thr Val Ala Phe Leu Gly Thr Ser Asp Gly Arg lie Leu 

405 410 415 

Lys Val Tyr Leu Thr Pro Asp Gly Thr Ser Ser Glu Tyr Asp Ser He 

420 * 425 430 

Leu Val Glu He Asn Lys Arg Val Lys Arg Asp Leu Val Leu Ser Gly 

435 440 445 

Asp Leu Gly Ser Leu Tyr Ala Met Thr Gin Asp Lys Val Phe Arg Leu 

450 455 460 

Pro Val Gin Glu Cys Leu Ser Tyr Pro Thr Cys Thr Gin Cys Arg Asp 
465 470 475 480 

Ser Gin Asp Pro Tyr Cys Gly Trp Cys Val Val Glu Gly Arg Cys Thr 

485 490 495 

Arg Lys Ala Glu Cys Pro Arg Ala Glu Glu Ala Ser His Trp Leu Trp 

500 505 510 

Ser Arg Ser Lys Ser Cys Val Ala Val Thr Ser Ala Gin Pro Gin Asn 

515 520 525 

Met Ser Arg Arg Ala Gin Gly Glu Val Gin Leu Thr Val Ser Pro Leu 

530 535 540 

Pro Ala Leu Ser Glu Glu Asp Glu Leu Leu Cys Leu Phe Gly Glu Ser 
545 550 555 560 

Pro Pro His Pro Ala Arg Val Glu Gly Glu Ala Val He Cys Asn Ser 

565 570 575 

Pro Ser Ser He Pro Val Thr Pro Pro Gly Gin Asp His Val Ala Val 

580 585 . 590 

Thr He Gin Leu Leu Leu Arg Arg Gly Asn He Phe Leu Thr Ser Tyr 

595 600 605 

Gin Tyr Pro Phe Tyr Asp Cys Arg Gin Ala Met Ser Leu Glu Glu Asn 

610 615 620 

Leu Pro Cys He Ser Cys Val Ser Asn Arg Trp Thr Cys Gin Trp Asp 
625 "* 630 635 640 
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Leu Arg Tyr His Glu Cys Arg Glu Ala Ser Pro Asn Pro Glu Asp Gly 

645 650 655 

lie Val Arg Ala His Met Glu Asp Ser Cys Pro Gin Phe Leu Gly Pro 

660 665 670 

Ser Pro Leu Val lie Pro Met Asn His Glu Thr Asp Val Asn Phe Gin 

675 680 685 

Gly Lys Asn Leu Asp Thr Val Lys Gly Ser Ser Leu His Val Gly Ser 

690 695 700 

Asp Leu Leu Lys Phe Met Glu Pro Val Thr Met Gin Glu Ser Gly Thr 
705 710 715 720 

Phe Ala Phe Arg Thr Pro Lys Leu Ser His Asp Ala Asn Glu Thr Leu 

725 730 735 

Pro Leu His Leu Tyr Val Lys Ser Tyr Gly Lys Asn lie Asp Ser Lys 

740 745 750 

Leu His Val Thr Leu Tyr Asn Cys Ser Phe Gly Arg Ser Asp Cys Ser 

755 ** 760 765 

Leu Cys Arg Ala Ala Asn Pro Asp Tyr Arg Cys Ala Trp Cys Gly Gly 

770 775 780 

Gin Ser Arg Cys Val Tyr Glu Ala Leu Cys Asn Thr Thr Ser Glu Cys 
785 790 795 800 

Pro Pro Pro Val He Thr Arg He Gin Pro Glu Thr Gly Pro Leu Gly 

_805. 81 0 _ 815 

Gly Giy lie Arg He Thr He Leu Gly Ser Asn Leu Gly Val Gin Ala 

820 825 830 

Gly Asp He Gin Arg He Ser Val Ala Gly Arg Asn Cys Ser Phe Gin 

835 840 845 

Pro Glu Arg Tyr Ser Val Ser Thr Arg He Val Cys Val He Glu Ala 

850 * 855 . 860 

Ala Glu Thr Pro Phe Thr Gly Gly Val Glu Val Asp Val Phe Gly Lys 
865 870 875 880 

Leu Gly Arg Ser Pro Pro Asn Val Gin Phe Thr Phe Gin Gin Pro Lys 

885 890 895 

Pro Leu Ser Val Glu Pro Gin Gin Gly Pro Gin Ala Gly Gly Thr Thr 

900 905 910 

Leu Thr He His Gly Thr His Leu Asp Thr Gly Ser Gin Glu Asp Val 

915 920 _ 925 

Arg Val Thr Leu Asn Gly Val Pro Cys Lys Val Thr Lys Phe Gly Ala 

930 935 940 

Gin Leu Gin Cys Val Thr Gly Pro Gin Ala Thr Arg Gly Gin Met Leu 
945 " 950 955 960 

Leu Glu Val Ser Tyr Gly Gly Ser Pro Val Pro Asn Pro Gly He Phe 

965 970 975 

Phe Thr Tyr Arg Glu Asn Pro Val Leu Arg Ala Phe Glu Pro Leu Arg 

980 985 990 

Ser Phe Ala Ser Gly Gly Arg Ser He Asn Val Thr Gly Gin Gly Phe 

995 1000 1005 

Ser Leu He Gin Arg Phe Ala Met Val Val He Ala Glu Pro Leu Gin 

1010 1015 1020 

Ser Trp Gin Pro Pro Arg Glu Ala Glu Ser Leu Gin Pro Met Thr Val 
1025 1030 1035 1040 

Val Gly Thr Asp Tyr Val Phe His Asn Asp Thr Lys Val Val Phe Leu 

1045 1050 1055 

Ser Pro Ala Val Pro Glu Glu Pro Glu Ala Tyr Asn Leu Thr Val Leu 

1060 1065 1070 

He Glu Met Asp Gly His Arg Ala Leu Leu Arg Thr Glu Ala Gly Ala 
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1075 1080 1085 

Phe Glu Tyr Val Pro Asp Pro Thr Phe Glu Asn Phe Thr Gly Gly Val 

1090 1095 1100 

Lys Lys Gin Val Asn Lys Leu He His Ala Arg Gly Thr Asn Leu Asn 
1105 1110 1115 1120 

Lys Ala Met Thr Leu Gin Glu Ala Glu Ala Phe Val Gly Ala Glu Arg 

1125 . 1130 1135 

Cys Thr Met Lys Thr Leu Thr Glu Thr Asp Leu Tyr Cys Glu Pro Pro 

1140 1145 1150 

Glu Val Gin Pro Pro Pro Lys Arg Arg Gin Lys Arg Asp Thr Thr His 

1155. 1160 1165 

Asn Leu Pro Glu Phe He Val Lys Phe Gly Ser Arg Glu Trp Val Leu 

1170 1175 1180 

Gly Arg Val Glu Tyr Asp Thr Arg Val Ser Asp Val Pro Leu Ser Leu 
1185 1190 1195 1200 

He Leu Pro Leu Val He Val Pro Met Val Val Val He Ala Val Ser 

1205 1210 1215 

Val Tyr Cys Tyr Trp Arg Lys Ser Gin Gin Ala Glu Arg Glu Tyr Glu 

1220 1225 1230 

Lys He Lys Ser Gin Leu Glu Gly Leu Glu Glu Ser Val Arg Asp Arg 

1235 1240 1245 

Cys Lys .Lys Glu Phe. _Thr Asp. Leu Met lie. _Glu Jlet._Glu .Asp_Gln Thr_ 

1250 1255 1260 

Asn Asp Val His Glu Ala Gly He Pro Val Leu Asp Tyr Lys Thr Tyr 
1265 1270 1275 1280 

Thr Asp Arg Val Phe Phe Leu Pro Ser Lys Asp Gly Asp Lys Asp Val 

1285 1290 1295 

Met He Thr Gly Lys Leu Asp He Pro Glu Pro Arg Arg Pro Val Val 

1300 1305 1310 

Glu Gin Ala Leu Tyr Gin Phe Ser Asn Leu Leu Asn Ser Lys Ser Phe 

1315 1320 1325 

Leu He Asn Phe He His Thr Leu Glu Asn Gin Arg Glu Phe Ser Ala 

1330 1335 1340 

Arg Ala Lys Val Tyr Phe Ala Ser Leu Leu Thr Val Ala Leu His Gly 
1345 1350 1355 1360 

Lys Leu Glu Tyr Tyr Thr Asp lie Met His -Thr Leu Phe Leu .Glu Leu 

1365 1370 1375 

Leu Glu Gin Tyr Val Val Ala Lys Asn Pro Lys Leu Met Leu Arg Arg 

1380 1385 1390 

Ser Glu Thr Val Val Glu Arg Met Leu Ser Asn Trp Met Ser He Cys 

1395 1400 1405 

Leu Tyr Gin Tyr Leu Lys Asp Ser Ala Gly Glu Pro Leu Tyr Lys Leu 

1410 1415 1420 

Phe Lys Ala He Lys His Gin Val Glu Lys Gly Pro Val Asp Ala Val 
1425 1430 1435 1440 

Gin Lys Lys Ala Lys Tyr Thr Leu Asn Asp Thr Gly Leu Leu Gly Asp 

1445 1450 1455 

Asp Val Glu Tyr Ala Pro Leu Thr Val Ser Val He Val Gin Asp Glu 

1460 1465 1470 

Gly Val Asp Ala He Pro Val Lys Val Leu Asn Cys Asp Thr He Ser 

1475 1480 1485 

Gin Val Lys Glu Lys He He Asp Gin Val Tyr Arg Gly Gin Pro Cys 

1490 1495 1500 

Ser Cys Trp Pro Arg Pro Asp Ser Val Val Leu Glu Trp Arg Pro Gly 
1505 1510 1515 1520 
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Ser Thr Ala Gin He Leu Ser Asp Leu Asp Leu Thr Ser Gin Arg Glu 

.1525 1530 1535 

Gly Arg Trp Lys Arg Val Asn Thr Leu Met His Tyr Asn Val Arg Asp 

1540 1545 1550 

Gly Ala Thr Leu He Leu Ser Lys Val Gly Val Ser Gin Gin Pro Glu 

1555 1560 1565 

Asp Ser Gin Gin Asp Leu Pro Gly Glu Arg His Ala Leu Leu Glu Glu 

1570 1575 1580 

Glu Asn Arg Val Trp His Leu Val Arg Pro Thr Asp Glu Val Asp Glu 
1585 1590 1595 1600 

Gly Lys Ser Lys Arg Gly Ser Val Lys Glu Lys Glu Arg Thr Lys Ala 

1605 1610 1615 

He Thr Glu lie Tyr Leu Thr Arg Leu Leu Ser Val Lys Gly Thr Leu 

1620 1625 1630 

Gin Gin Phe Val Asp Asn Phe Phe Gin Ser Val Leu Ala Pro Gly His 

1635 1640 1645 

Ala Val Pro Pro Ala Val Lys Tyr Phe Phe Asp Phe Leu Asp Glu Gin 

1650 1655 1660 

Ala Glu Lys His Asn He Gin Asp Glu Asp Thr He His He Trp Lys 
1665 1670 1675 1680 

Thr Asn Ser Leu Pro Leu Arg Phe Trp Val Asn He Leu Lys Asn Pro 

1685 1690 1695 

His Phe He Phe Asp Val His Val His Glu Val Val Asp Ala Ser Leu 

1700 1705 1710 

Ser Val He Ala Gin Thr Phe Met Asp Ala Cys Thr Arg Thr Glu His 

1715 1720 1725 

Lys Leu Ser Arg Aso Ser Pro Ser Asn Lys Leu Leu Tyr Ala Lys Glu 

1730 1735 1740 

He Ser Thr Tyr Lys Lys Met Val Glu Asp Tyr Tyr Lys Gly He Arg 
1745 1750 1755 1760 

Gin Met Val Gin Val Ser Asp Gin Asp Met Asn Thr His Leu Ala Glu 

1765 1770 1775 

He Ser Arg Ala His Thr Asp Ser Leu Asn Thr Leu Val Ala Leu His 

1780 1785 1790 

Gin Leu Tyr Gin Tyr Thr Gin Lys Tyr Tyr Asp Glu He lie Asn Ala 

1795 1800 1805 

Leu Glu Glu Asp Pro Ala Ala Gin Lys Met Gin Leu Ala Phe Arg Leu 

1810 1815 1820 

Gin Gin He Ala Ala Ala Leu Glu Asn Lys Val Thr Asp Leu 
1825 1830 1835 

<210> 3 

<211> 5367 

<212> DNA 

<213> HOMO SAPIEN 

<400> 3 

atggctcgct ggcctccctt cggcctctgc ctcctcctgc tgctgctgtc cccaccgcca 60 

ctgcccttga caggggccca tcgcttctcc gcacctaata ccactctcaa ccacttggca 120 

ctggcacctg gccgaggcac actctatgtc ggcgcagtga accgcctctt ccagctcagc 180 

cccgagctgc agctcgaggc cgtggctgtc actggccctg taatcgacag ccctgactgc 240 

gtgcccttcc gtgacccagc cgagtgccca caggcccagc tcactgacaa tgccaaccag 3 00 

ctgctgctgg tgagcagccg cgcccaggag ctggtggcct gcgggcaggt gcggcagggc 3 60 

gtgtgtgaga cacggcgcct tggggatgtg gccgaggtgc tgtaccaggc tgaggaccct 420 

ggtgacgggc agtttgtggc tgccaatacc ccgggagtgg caacggtggg gctggtggtg 480 
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cccttgcccg gccgggacct cctgcttgtg gccagaggcc tggcgggcaa gctgtcggca 540 
ggggtgccac ccctggccat ccgccagctg gccgggcctc agcccttctc cagcgagggc 600 
ctgggccgcc tggtggtggg cgacttcccc gactacaaca acagctacgt cggggccttt 660 
gccgacgccc gctccgccta cttcgtgttc cgccgccgcg gggcccgggc ccaggctgag 720 
taccgctcct acgtggcccg cgtctgcctg ggggacacca acctgtactc ctacgtggag 780 
gtccccctcg cccgccaggg ccagggcctc atccaggccg ccttccttgc cccgggcacc 840 
ttgctagggg tgtttgccgc gggcccaagg ggcacccagg cggcgctctg tgccttcccc 900 

atggtggagc tgggtgccag catggagcag gcccggagac tccgctacac ggcgggcggc * 960 

cggggcccca gcggcgcaga ggaagccacc gtggagtacg gcgtcacgtc gcgccgcgtc 1020 

accctgcccc ctgattcccc cgagtcgtac ccctgtggcg acgagcacac ccccagcccc i080 

attgctggcc gccagcccct ggaggtccag cctctgctga agctcgggca gccggtcagc 1140 

gccgtggcag ctctccaggc agatgggcac acgatagcct tcctggggga cacccagggc 1200 

cagctgtaca aggtctttct ccacggcccc cagggccagg tttaccactc ccagcaagtg 1260 

gggcctccag gctcagccat cagcccagac ctgctgctgg acagcagtgg cagtcacctc 1320 

tatgtcctga ctgcccacca ggtggaccgg atacctgtgg cagcctgccc ccagttccct 1380 

gactgtgcca gctgcctcca ggcccaggac ccgctgtgtg gctggtgtgt cctccagggc 1440 

aggtgtaccc ggaagggcca gtgcgggcgg gcaggccagc tgaaccagtg gctgtggagt 1500 

tatgaggagg acagccactg cctgcacatc cagagcctgc tgccgggcca ccacccccgc 1560 

caggagcagg gccaggtcac tttgtctgtc ccccggctgc ccatcctgga tgcagatgaa 1620 

tacttccatc gtgcgttcgg ggactatgac agcttggctc atgtggaagg gccccacgtg 1680 

gcctgtgtca cccctcccca agaccaggtg ccacttaacc ctccaggcac agaccacgtc 1740 

-accgcgcccc -C-ggcccr.ga-t-jg-r,t cgaggac _gtg.ac.tgjtg.g XLtg.cicacica_a _ctj:_c ; tec ttt ^ 1800 

tatgactgea gtgccgtcca ggccttggag gcggctgccc ccgtccttcc ccagggcctg 1860 

cctgcctcct tccactgctg gctggagctg cctggagaac • tteggggact gccggccacc 1920 

ctggaggaga cagcagggga ttcaggcctc atccactgcc aggcccacca gegggagetc 1980 

ccagtgccca tctacgtcac ccagggtgaa gcccagaggc tggacaacac ccatgctctt 2040 

tatggtgagc ctgagggcag ccaggcaggc ggggcagggt gggtggcaga caggaggege 2100 

tcagcacact gcctgacccc ccctagtgat cctgtacgac tgcgccatgg gccacccgga 2160 

ctgcagccac tgecaagegg ccaacaggag cctgggctgc ctgtgaccag ccctgcccca 2220 

ggcccccaaa ccccagcagc tcggcctggc tgggctggtt ggctggccgg gcacccagca 2280 

ctgcagagtg gagcgtgggt gegggggace ccatctgcca teatttgect getgeaggtc 2340 

gagcccctga ccggtccccc tgagggaggc ttggccctca ccatcctggg ctccaacctg 2400 

ggccgggcct tegecga'tgt gcagtacgcc gaccctgtcc tgetgagect gagtcctcgc 2460 

tggggccccc aggcaggggg cacccagctc accatccgag gtcagcacct ccagacaggt 2520 

ggcaacacca gtgccttcgt gggtggccaa ccctgtccca tgggtgggcg actgatccgt 2580 

gtcaggggca ccggcctaga cgtggtgcag cggcccctac tgtctgtgtg gctggaggct 2640 

gaegcagagg tgeaggctte cagggcccag ccccaggacc cacagccaag gaggagctgt 2700 

ggagcccctg ctgcggaccc ccaggcttgt atccagctcg gtggggggct gctgcagcgc 2760 

acagcagagc ccagctcact ccacctgtgg tcggccctga atgccccaca gtgctccacc 2820 

gtctgctccg tcaactcgtc cagcctcctc ctgtgccgga gccctgctgt accagacaga 2880 

gcccacccgc agegggtett cttcacccta gacaacgtgc aagtggactt cgccagtgcc 2940 

agtgggggcc agggcttcct gtaccagccc aacccccgcc tggcacccct cagccgcgag 3000 

gggcctgccc gcccctaccg cctcaagcca ggccatgtcc tggatgtgga gggegaggge 3060 

ctcaacctgg gcatcagcaa ggaggaggtg cgcgtgcaca tcggccgcgg cgagtgcctg 3120 

gtgaagaege tcacgcgcac ccacctgtac tgcgagccgc ctgcgcacgc cccgcagcct 3180 

gecaatgget ccggcctgcc acagttcgtg gtgcagatgg gcaatgtgca gctggccctg 3240 

ggccctgtgc agtacgaggc tgaacccccg ctgtctgcct ttcccgtgga ggcccaggca 3300 

ggcgtgggca tgggtgctgc agtgctgatt gccgccgtgc tcctcctcac cctcatgtac 3360 

aggcacaaga gcaagcaggc cctgcgggac taccagaagg tgctagtgca gctggagagc 3 420 

ctggagaccg gcgtgggaga ccagtgccgc aaggagttca cagacctcat gaeggagatg 3480 

accgacctca gcagcgacct ggagggcagc gggatcccct tcctggacta ccgcacctac 3540 

gccgagcgcg ccttcttccc tggccatggc ggttgcccgc tgcagcccaa gectgagggg 3600 

ccaggggagg acggccactg tgccactgtg cgccagggcc tcacgcagct ctccaacctg 3660 

ctcaacagca agctcttcct cctcacggtg agggccgtgt ggcgggagtg cccagtgggc 3720 

aaggaggtgg ggctggggaa ctactggcct gagacaaagg tgggggagga gacagagacc 3780 



• 8 - 



WO 01/14420 



PCT/US00/23365 



atggtggaga 
gtggctggtg 
cccgtggacg 
gaggacgtgg 
gccgcaggca 
acccaggtca 
ccctcagtgc 
cccttcccta 
acgctggagg 
gagccagaag 
aaggccattc 
gtggctgggc 
gcaggggcag 
cctgcctcgc 
gtgaaccggc 
gagaagcacg 
ctgcggttct 
gacaatgtgg 
tcggagcata 
aggaggaaag 
ggcagcatca 
gagcaggatt 
atggtggaga 
aactctgctt 
ctgcaagaac 
gaggaccctg 
gtggaaaaca 



aactgctcac 
aaccactgta 
ccgtgacagg 
agttccagcc 
gcagcgagat 
aggagaaggt 
atgccctaga 
cccctccggc 
atggcgagga 
gggccaaggt 
cggaaatcta 
ctgagaggag 
cctcagccgt 
agggcacgct 
ccatccccat 
gcatcgagga 
gggtgaatgc 
acgccatcct 
aagtgggccg 
gcttatgggg 
tgggggcacc 
ccccagtgaa 
ggtactatgc 
tggctgagct 
tctacaacca 
tgggccagaa 
aagtgacxga 



caactggctg 
catgctcttc 
caaggccaaa 
cctgacgctg 
gcagcgcgtg 
gttggaccaa 
ccttggtgag 
acctggagcc 

ggggggggtg 
"gcggtgcagc 
cctcacccgt 
gctcagccag 
ggatggcccc 
gcagaagttt 
cgccgtcaag 
cccagggacc 
cttgaagaac 
tgctgtcatc 
ggtgagagca 
gaagcctaga 
ttcacctccg 
-caaactgctc 
ggacattcgc 
ctccgggaac 
catccacagg 
gctgcagctg 
cctgtga 



tccatctgcc 
cgggccatcc 
cggaccctga 
atggtgctgg 
ccagcccggg 
gtctacaagg 
agagccagcc 
cctcaactgt 
tgcctctggc 
agcctgcggg 
ctgctgtcca 
ggaccccgac 
cacaccctgc 
gtggacgaca 
tacctgtttg 
ctgcacatct 
ccacagctca 
gcccagacct 
gtgccagcag 
gggctgtgca 
agctcatgcc 
-tacgcccggg 
cagagctctc 
tacacttctg 
tactatgatc 
gcctgccgcc 



tgtacgcct t 
agtaccaggt 
atgatagccg 
tggggcccgg 
tgctcgacac 
gcaccccctt 
ctgcccaccc 
gtcttactat 
acctggtgaa 
agcgcgagcc 
tgaaggttgg 
cgagccaggg 
cctccacaca 
ccttccaggc 
accttctgga 
ggaagaccaa 
tctttgatgt 
tcattgactc 
cagcagctgg 
cagagctctg 
tagcgcctcc 
agatcecacg 
cggcgagcta 
ctccccactg 
agattatcag 
tgcagcaggt 



cc tgagggag 

ggacaaaggc 

cttgctgcgg 

ggctggcggg 

ggacaccatc 

ctcccagagg 

accccaggga 

gaacataccc 

agccaccgag 

agcaagggcc* 

tgcggcctgg 

tgtgggaggg 

gcccttatcc 

cattctcagc 

tgagctagca 

cagtctgctg 

acgggtgtcg 

ctgtaccacc 

caggggcttg 

ggtgggcagt 

cctccctccg 

ct-acaagcag 

ccaggagatg 

tctggaggct 

tgccctggag 

cgccgccctg 



3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5-100 
5160 
5220 
5280 
5340 
5367 



<210> 4 

<211> 1788 

<212> PRT 

<213> HOMO SAPIEN 



<400> 4 






Met 


Ala 


Arg 


Trp 


1 








Ser 


Pro 


Pro 


Pro 








20 


Asn 


Thr 


Thr 


Leu 






35 




Tyr 


Val 


Gly 


Ala 




50 






Leu 


Glu 


Ala 


Val 


65 








Val 


Pro 


Phe 


Arg 


Asn 


Ala 


Asn 


Gin 








100 


Ala 


Cys 


Gly 


Gin 






115 




Asp 


Val 


Ala 


Glu 




130 






Phe 


Val 


Ala 


Ala 



Pro 


Pro 


Phe 


Gly 


5 








Leu 


Pro 


Leu 


Thr 


Asn 


His 


Leu 


Ala 








40 


Val 


Asn 


Arg 


Leu 






55 




Ala 


Val 


Thr 


Gly 




70 






Asp 


Pro 


Ala 


Glu 


85 








Leu 


Leu 


Leu 


Val 


Val 


Arg 


Gin 


Gly 








120 


Val 


Leu 


Tyr 


Gin 






135 




Asn 


Thr 


Pro 


Gly 



Leu 


Cys 


Leu 


Leu 




10 






Gly 


Ala 


His 


Arg 


25 








Leu 


Ala 


Pro 


Gly 


Phe 


Gin 


Leu 


Ser 








60 


Pro 


Val 


He 


Asp 






75 




Cys 


Pro 


Gin 


Ala 




90 






Ser 


Ser 


Arg 


Ala 


105 








Val 


Cys 


Glu 


Thr 


Ala 


Glu 


Asp 


Pro 








140 


Val 


Ala 


Thr 


Val 



Leu 


Leu 


Leu 


Leu 






15 




Phe 


Ser 


Ala 


Pro 




30 






Arg 


Gly 


Thr 


Leu 


45 








Pro 


Glu 


Leu 


Gin 


Ser 


Pro 


Asp 


Cys 








80 


Gin 


Leu 


Thr 


Asp 






95 




Gin 


Glu 


Leu 


Val 




110 






Arg 


Arg 


Leu 


Gly 


125 








Gly 


Asp 


Gly 


Gin 


Gly 


Leu 


Val 


Val 
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145 150 155 160 

Pro Leu Pro Gly Arg Asp Leu .Leu Leu Val Ala Arg Gly Leu Ala Gly 

165 170 175 

Lys Leu Ser Ala Gly Val Pro Pro Leu Ala lie Arg Gin Leu Ala Gly 

180 185 190 . 

Ser Gin Pro Phe Ser Ser Glu Gly Leu Gly Arg Leu Val Val Gly Asp 

195 200 205 

Phe Ser Aso Tyr Asn Asn Ser Tyr Val Gly Ala Phe Ala Asp Ala Arg 

210 ~ 215 220 

Ser Ala Tyr Phe Val Phe Arg Arg Arg Gly Ala Arg Ala Gin Ala Glu 
225 - 230 235 240 

Tyr Arg Ser Tyr Val Ala Arg Val Cys Leu Gly Asp Thr Asn Leu Tyr 

245 250 255 

Ser Tyr Val Glu Val Pro Leu Ala Cys Gin Gly Gin Gly Leu lie Gin 

260 265 270 

Ala Ala Phe Leu Ala Pro Gly Thr Leu Leu Gly Val Phe Ala Ala Gly 

275 280 285 

Pro Arg Gly Thr Gin Ala Ala Leu Cys Ala Phe Pro Met Val Glu Leu 

290 " 295 300 

Gly Ala Ser Met Glu Gin Ala Arg Arg Leu Cys Tyr Thr Ala Gly Gly 
305 310 315 320 

- Arg Gly -P-ro^Ser -Gly -Ala jGIu j31u _ Ala Thr JVal . JU.u Tyr _G.ly._Val Thr 

325 330 335 

Ser Arg Cys Val Thr Leu Pro Leu Asp Ser Pro Glu Ser Tyr Pro Cys 

340 345 350 

Gly Asp Glu His Thr Pro Ser Pro lie Ala Gly Arg Gin Pro Leu Glu 

355 360 365 

Val Gin Pro Leu Leu Lys Leu Gly Gin Pro Val Ser Ala Val Ala Ala 

370 375 380 

Leu- Gin Ala Asp Gly His Met lie Ala Phe Leu Gly Asp Thr Gin Gly 
385 390 395 400 

Gin Leu Tyr Lys Val Phe Leu His Gly Ser Gin Gly Gin Val Tyr His 

405 410 415 

Ser Gin Gin Val Gly Pro Pro Gly Ser Ala He Ser Pro Asp Leu Leu 

420 425 430 

Leu Asp Ser Ser Gly Ser His Leu Tyr Val Leu Thr Ala His Gin Val 

435 440 445 

Asp Arg He Pro Val Ala Ala Cys Pro Gin Phe Pro Asp Cys Ala Ser 

450 455 460 

Cys Leu Gin Ala Gin Asp Pro Leu Cys Gly Trp Cys Val Leu Gin Gly 
465 470 475 480 

Arg Cys Thr Arg Lys Gly Gin Cys Gly Arg Ala Gly Gin Leu Asn Gin 

485 490 495 

Trp Leu Trp Ser Tyr Glu Glu Asp Ser His Cys Leu His He Gin Ser 

500 505 510 

Leu Leu Pro Gly His His Pro Arg Gin Glu Gin Gly Gin Val Thr Leu 

515 520 525 

Ser Val Pro Arg Leu Pro He Leu Asp Ala Asp Glu Tyr Phe His Cys 

530 535 540 

Ala Phe Gly Asp Tyr Asp Ser Leu Ala His Val Glu Gly Pro His Val 
545 550 555 560 

Ala Cys Val Thr Pro Pro Glh Asp Gin Val Pro Leu Asn Pro Pro Gly 

565 570 575 

Thr Asp His Val Thr Val Pro Leu Ala Leu Met Phe Glu Asp Val Thr 
580 585 590 
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val 


Ala 


Al a 


Th t* 

L 1 11. 


Asn. 


Phe 


Ser 




xy i 


Asp 
















600 






Leu 




ax a 


Ala 

Al CL 


AT a 


Prn 


V OX 


Leu 


XT J- U 


Gin 




Q1U 










615 








HIS 


Cys 


Trp 


Leu 


Pi i i 
Lai U. 


- 

Leu 


.T X LJ 


PI \/ 


Pi 11 


Leu 






















Leu 




bill 


rpVi -r- 

i nr 


Ala 


Lsiy 


ASp 


oer 


uiy 


J— it. Li 










Oft j 










650 


Gin 


Arg 


GlU 


Leu 


r _ O 


\/ j a 1 
Vol 


Pro 


T 1 o 

lie 


Tyr 


Va 1 
v a j. 








rr c n 
ObU 










C £ 
ODD 




Arg 


Leu 


Asp 


Asn 


Tnr 


nlS 


Axa 


Leu 


Tyr 


Pi vr 

biy 




O / D 










DO U 






Ala 


Gly 


Gly 


Ala 


Giy 


Trp 


vax 


Ala 


Asp 


Arg 














fi Q S 

O _7 J 








Leu 


Thr 


Leu 


Pro 


Ser 


Asp 


Pro 


Val 


Arg 


T .01 1 
Lie Li 


705 










/ X u 










Leu 


Qji xn 


Pro 


Leu 


Pro 


Ser 




Lt.LJ.1 


Gin 


Glu 










725 










730 


Ser 


Pro 


ax a 


Pro 


Pi \r 


pro 


VjJ.Il 


1 111 


JET X w 


Ala 








/ ft u 










745 




Gly 


Trp 


Leu 


Ala 


Laiy 


illS 


Pro 


Ala 
nla 


Leu 


PI n 

La ±.11 




7cc 










760 






Gly 


Tnr 


Pro 


Ser 


a 1 a 
Ala 


Tin 

x xe 


Tla 
lie 




Leu 


Leu 


"7*7 O 










775 










Pro 


Pro 


uXU 


Pi vr 


Pi \7 


Leu 


Ala 


Leu 


Thr 


785 










7 90 










Gly 


Arg 


Axa 


O Vi a 


nla 


Asp 


Val 


uin 




Ala 






one; 










810 


Leu 


Ser 


Pro 


Arg 


irp 


v^xy 


pro 


uin 


Al 3 
nla 


PI v 








o z U 










O Z D 




Arg 


Gly 


Gin 


His 


Leu 


Gin 


Tnr 


(jiy 


Giy 


Asn 




O J D 










0 ft L/ 






Gly 


Gin 


Pro 


Cys 


Pro 


Met 


Gly 


Gly 


Arg 


Leu 


OCA 










pec 








Gly 


Leu 


Asp 


val 


Val 


Gin 


Arg 


Pro 


Leu 


Leu 


865 










o / u 










Asp 


Ala 


GlU 


vai 


Lrin. 


ai a 
AX a 


Ser 


Arg 


Ala 


Gin 








OO j 










890 

O _/ w 


Arg 


Arg 


Ser 


Cys 


pi -vr 
Gxy 


Ala 
Ala 


Pro 


Ala 


Al a 
nlo 


Asp 






y uu 










y u j 




Leu 


Gly 


Gly 


Gly 


Leu 


Leu 


Gxn 


Arg 


mr 


Ala 
Ala 






QIC 










q 9 n 






Leu 


Trp 


Ser 


Ala 


Leu 


Asn 


Ala 

ax a 


Pro 


Pi n 


L* Jf S3 




y j u 


















Asn 


Ser 


Ser 


Ser 


Leu 


Leu 


Leu 


Cys 


Arg 




945 




















Ala 


His 


Pro 


Gin 


Arg 


v ax 


Phe 


Phe 


Thr 


Leu 










965 










970 


Phe 


Ala 


Ser 


Ala 


Ser 


Gly 


Gly 


Gin 


Gly 


Phe 








980 










985 




Arg 


Leu 


Ala 


Pro 


Leu 


Ser 


Arg 


Glu 


Gly 


Pro 




995 










1000 




Lys 


Pro 


Gly 


His 


Val 


Leu 


Asp 


Val 


Glu 


Gly 


1010 








1015 






lie 


Ser 


Lys 


Glu 


Glu 


Val 


Arg 


Val 


His 


He 
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Cvs 


Ser 


Ala 


Val 


Gin 


Ala 






605 








Gly 


Leu 


Pro 


Ala 


Ser 


Phe 




620 










A rrr 

ax y 


Gly Leu 


Pro 


Ala 


Thr 


635 










640 


Tip 
lit? 


His 


Cys 


Gin 


Ala 


His 










655 




X A 1 J- 


Gin 


Gly 


Glu 


Ala 


Gin 








670 






L7 1 Li 


Pro 


Glu 


pi \/ 


Ser 


Gin 






685 








Arg 


Arg 


Ser 


Ala 


His 


Pvs 
L,y a 




700 










_ 

Arg 


His 


Gly 


Pro 


Pro 


Glv 


/ 1 










72 0 


JCr X L/ 


Gly 


Leu 


Pro 


Val 


Thr 










735 




Ala 


Arg 


Pro 


Glv 


TlTD 


Ala 








750 








Gly Ala 


a 1 y 


Val 


At"ct 






765 








Pi n 


Val 


"Glu 


Pro 


Leu 


Thr 




780 










He 


Leu 


Gly 


Ser 


Asn 


Leu 


795 










800 


Asp 


Pro 


Val 


Leu 


Leu 


Ser 








815 




Pi \r 

\ji y 


Thr 


Gin 


T .^i i 

XJGLL 


Thr 


lie 












rpl_ 


Ser 


Ala 


Jrne 


\/a1 

vai 


LrXy 






845 








Ti- 
ne 


Arg 


Val 


Arg 


Pi \r 

Lsiy 


1 XIX 




860 










Ser 


Val 


Trp 


Leu 


Pi n 


Al a 


0/3 










880 


Pro 


Gin Asp 


Prn 


Gin 


Pro 










895 




pro 


Gin 


Ala 






Gin 








910 






PI u 


Pro 


Ser 


Ser 


Leu 


His 






925 










Thr 


Val 


L~yti 


Ser 


Val 




940 










ir 1. \J 


Ala 


Val 


Pro 


Asp 


Arg 


y -j *j 










960 


Asp 


Asn 


Val 


Gin 


Va 1 


Asp 










975 




Leu 


Tyr 


Gin 


Pro 


Asn 


Pro 








990 






Ala 


Arg 


Pro 


Tyr 


Arg 


Leu 






1005 






Glu 


Gly Leu Asn 


Leu 


Gly 




1020 








Gly 


Arg Gly 


Glu 


Cys 


Leu 
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1025 1030 1035 1040 

Val Lys thr Leu Thr Arg Thr His Leu Tyr Cys Glu Pro Pro Ala His 

1045 1050 1055 

Ala Pro Gin Pro Ala Asn Gly Ser Gly Leu Pro Gin Phe. Val Val Gin 

1060 1065 1070 

Met Gly Asn Val Gin Leu Ala Leu Gly Pro Val Gin Tyr Glu Ala Glu 

1075 1080 1085 

Pro Pro Leu Ser Ala Phe Pro Val Glu Ala Gin Ala Gly Val Gly Met 

1090 1095 1100 

Gly Ala Ala Val Leu He Ala Ala Val Leu Leu Leu Thr Leu Met Tyr 
1105 1110 1115 1120 

Arg His Lys Ser Lys Gin Ala Leu Arg Asp Tyr Gin Lys Val Leu Val 

1125 1130 1135 

Gin Leu Glu Ser Leu Glu Thr Gly Val Gly Asp Gin Cys Arg Lys Glu 

1140 1145 1150 

Phe Thr Asp Leu Met Thr Glu Met Thr Asp Leu Ser Ser Asp Leu Glu 

1155 1160 1165 

Gly Ser Gly He Pro Phe Leu Asp Tyr Arg Thr Tyr Ala Glu Arg Ala 

1170 1175 1180 

Phe Phe Pro Gly His Gly Gly Cys Pro Leu Gin Pro Lys Pro Glu Gly 
1185 1190 1195 1200 

.Pro Gly Glu Asp .Gly _His ..Cys _Ala JThr Val ,Arg G_ln _Gly Leu Thr Gin 

1205 1210 1215 

Leu Ser Asn Leu Leu Asn Ser Lys Leu Phe Leu Leu Thr Val Arg Ala 

1220 1225 1230 

Val Trp Arg Glu Cys Pro Val Gly Lys Glu Val Gly Leu Gly Asn Tyr 

1235 1240 1245 

Trp Pro Glu Thr Lys Val Gly Glu Glu Thr Glu Thr Met Val Glu Lys 

1250 1255 1260 

Leu Leu Thr Asn Trp Leu Ser He Cys Leu Tyr Ala Phe Leu Arg Glu 
1265 1270 1275 1280 

Val Ala Gly Glu Pro Leu Tyr Met Leu Phe Arg Ala He Gin Tyr Gin 

1285 1290 1295 

Val Asp Lys Gly Pro Val Asp Ala Val Thr Gly Lys Ala Lys Arg Thr 

1300 1305 1310 

Leu Asn Asp Ser Arg Leu Leu Arg Glu Asp Val Glu Phe Gin Pro Leu 

1315 1320 1325 

Thr Leu Met Val Leu Val Gly Pro Gly Ala Gly Gly Ala Ala "Gly Ser 

1330 1335 1340 

Ser Glu Met Gin Arg Val Pro- Ala Arg Val Leu Asp Thr Asp Thr He 
1345 1350 1355 1360 

Thr Gin Val Lys Glu Lys Val Leu Asp Gin Val Tyr Lys Gly Thr Pro 

1365 1370 .1375 

Phe Ser Gin Arg Pro Ser Val His Ala Leu Asp Leu Gly Glu Arg. Ala 

1380 " 1385 1390 

Ser Pro Ala His Pro Pro Gin Gly Pro Phe Pro Thr Pro Pro Ala Pro 

1395 1400 1405 

Gly Ala Pro Gin Leu Cys Leu Thr Met Asn He Pro Thr Leu Glu Asp 

1410 1415 1420 

Gly Glu Glu Gly Gly Val Cys Leu Trp His Leu Val Lys Ala Thr Glu 
1425 1430 - 1435 1440 

Glu Pro Glu Gly Ala Lys Val Arg Cys Ser Ser Leu Arg Glu Arg Glu 

1445 1450 1455 

Pro Ala Arg Ala Lys Ala He Pro Glu He Tyr Leu Thr Arg Leu Leu 
1460 1465 1470 
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Ser Met Lys Val Gly Ala Ala Trp Val Ala Gly Pro Glu Arg Arg Leu 

1475 1480 1485 

Ser Gin Gly Pro Arg Pro Ser Gin Gly Val Gly Gly Ala Gly Ala Ala 

1490 1495 1500 

Ser Ala Val Asp Gly Pro His Thr Leu Pro Ser Thr Gin Pro Leu Ser 
1505 1510 1515 1520 

Pro Ala Ser Gin Gly Thr Leu Gin Lys Phe Val Asp Asp Thr Phe Gin 

1525 1530 1535 

Ala He Leu Ser Val Asn Arg Pro He Pro He Ala Val Lys Tyr Leu 

1540 1545 1550 

Phe Asp Leu Leu Asp Glu Leu Ala Glu Lys His Gly He Glu Asp Pro 

1555 1560 1565 

Gly Thr Leu His He Trp Lys Thr Asn Ser Leu Leu Leu Arg Phe Trp 

1570 1575 1580 

Val Asn Ala Leu Lys Asn Pro Gin Leu He Phe Asp Val Arg Val Ser 
1585 1590 1595 1600 

Asp Asn Val Asp Ala He Leu Ala Val He Ala Gin Thr Phe lie Asp 

1605 1610 1615 

Se^ cys Thr Thr Ser Glu His Lys Val Gly Arg Val Arg Ala Val Pro 

1620 1625 1630 

Ala Ala Ala Ala Gly Arg Gly Leu Arg Arg Lys Gly Leu Trp Gly Lys 

1635 1640 1645 

Pro Arg Gly Leu "Cys Thr Glu 'Leu Trp Val Gly "Ser Gly Ser He Met 

1650 1655 1660 

Gly Ala Pro Ser Pro Pro Ser Ser Cys Leu Ala Pro Pro Leu Pro Pro 
1665 1670 1675 1680 

Glu Gin Asp Ser Pro Val Asn Lys Leu Leu Tyr Ala Arg Glu lie Pro 

1685 1690 1695 

Arg Tyr Lys Gin Met Val Glu Arg Tyr Tyr Ala Asp He Arg Gin Ser 

1700 1705 1710 

Ser Pro Ala Ser Tyr Gin Glu Met Asn Ser Ala Leu Ala Glu Leu Ser 

1715 ^ 1720 1725 

Gly Asn Tyr Thr Ser Ala Pro His Cys Leu Glu Ala Leu Gin Glu Leu 

1730 1735 1740 

Tyr Asn His He His Arg Tyr Tyr Asp Gin He He Ser Ala Leu Glu 
1745 1750 1755 17 ^° 

Glu Asp Pro Val Gly Gin Lys Leu Gin Leu Ala Cys Arg Leu Gin Gin 

1765 1770 1775 

Val Ala Ala Leu Val Glu Asn Lys Val Thr Asp Leu 
1780 1785 

<210> 5 

<211> 5892 

<212> DNA 

<213> HOMO SAPIEN 

<400> 5 crx 
gcgcacgccc ggatggctct tcgcgccgcg ggcggcgcac cctttagcgg cccggccgcc 60 
gctgccagcc ccccgccgtt ccagacgccg ccgcggtgcc cggtgccgct gctgttgctg 12 0 
ctgctcctgg gggcggcgcg ggccggcgcc ctggagatcc agcgtcggtt cccctcgccc 180 
acgcccacca acaacttcgc cctggacggc gcggcgggga ccgtgtacct ggcggccgtc 240 
aaccgcctct atcagctgtc gggcgccaac ctgagcctgg aggccgaggc ggccgtgggc 300 
ccggtgcccg acagcccgct gtgtcacgct ccgcagctgc cgcaggcctc gtgcgagcac 360 
ccgcggcgcc tcacggacaa ctacaacaag atcctgcagc tggaccccgg ccagggcctg 420 
gtagtcgtgt gcgggtccat ctaccagggc ttctgccagc tgcggcgccg gggtaacatc 480 
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tcggccgtgg ccgtgcgctt cccgcccgcc gcgccgcccg ccgagcccgt cacggtgttc 540 

cccagcatgc tgaacgtggc ggccaaccac ccgaacgcgt ccaccgtggg gctagttctg 600 

ccccccgccg cgggcgcggg gggcagccgc ctgctcgtgg gcgccacgta caccggttac 660 

ggcagctcct tcttcccgcg caaccgcagc ctggaggacc accgcttcga gaacacgccc 720 

gagatcgcca tccgctccct ggacacgcgc ggcgacctgg ccaagctctt caccttcgac 780 

ctcaacccct ccgacgacaa catcctcaag atcaagcagg gcgccaagga gcagcacaag 840 

ccgggcttcg tgagcgccct cctgcacccg tccgacccgc cgccgggtgc acagtcctac 900 

gcgcacctgg cgctcaacag cgaggcgcgc gcgggcgaca aggagagcca ggcgcggagc 960 

ccgctggcgc gcatctgcct gccccacggc gccggcggcg acgccaagaa ^gctcaccgag 1020 

tcctacatcc agttgggctt gcagtgcgcg ggcggcgcgg gccgcggcga cctctacagc 1080 

cgcctggtgt cggtcttccc agcccgggag cggctctttg ctgtcc'tcga gcggccccag 1140 

gggtcccccg cggcccgcgc tgctccggcc gcactctgcg ccttccgctt cgccgacgcg 1200 

cgagccgcca tccgagctgc gcgcaccgcc tgcttcgtgg aaccggcgcc cgacguggtg 12 60 

gcggtgcrcg acagcgtggt gcagggcacg ggaccggcct gcgagcgcaa gctcaacatc 132 0 

cagctccagc cagagcagcc ggactgtgga gctgctcacc tgcagcaccc gctgtccatc 1380 

ctccagcccc tgaaggccac gcccgtgttc cgcgccccgg gcctcacctc cgtggccgtg 1440 

gccagcgtca acaactacac agcggtcttc ctgggcacgg tcaacgggag gcttctcaag 1500 

accaacctga acgagagcat gcaggtggtg agcaggcggg tggtgactgt ggcctatggg 1560 

gagcccgtgc accatgtcat gcagtttgac ccagcagact ccggttacct ttacctgatg 1620 

acgncccacc agatggccag ggtgaaggtc gccgcctgca acgtgcactc cacctgtggg 1680 

gaccgcgtgg gtgcggcgga cgcctactgc ggctggtgtg ccctggagac gcggtgcacc 1740 

.. 1 1 g cagcagg actgc accaa ttccagccag _qagca t ttc; t ggaccagtqc c agcgagggc 180 0 

cccagccgct gtcctgccat gaccgtcctg ccttccgaga tcgatgtgcg ccaggagtac "i860 

ccaggcatga tcctgcagat ctcgggcagc ctgcccagcc tcagtggcat ggagatggcc 1920 

tgngactatg ggaacaacat ccgcactgtg gctcgggtcc caggccctgc ctttggtcac 1980 

cacattgccc actgcaacct cctgccgagg gaccagtttc cgcccttccc ccccaaccag 2 040 

gaccacgtga ctgttgagat gtctgtgagg gtcaatgggc ggaacatcgt caaggccaat . 2100 

ttcaccatcc acgactgcag ccgcactgca caagtgtacc cccacacagc ctgtaccagc 2160 

tgcctgtcgg cacagtggcc ctgtttctgg tgcagccagc agcactcctg tgtttccaac 2220 

caccctcggc gcgaggcctc accaaacccc acgagccctc aggactgccc ccggaccctg 2280 

cccccacccc tggcacccgt gcctaccggt ggctcccaga acatcctggt gcctctggcc 2340 

aacactgccc ttttccaggg tgcagccctg gagtgtagtt ttgggctgga ggagatcttc 2400 

gaggctgtgt gggtgaatga gtctgttgta cgctgtgacc aggtggtgct gcacacgacc 2460 

cggaagagcc aggtgttccc gctcagcctc caactaaagg ggcggccagc ccgattcctg 2520 

gacagccctg agcccatgac. agtcatggtc tataactgtg ccatgggcag ccccgactgt 2580 

tcccagtgcc tgggccgcga agacctgggt cacctgtgcg tgtggagtga tggctgccgc 2640 

ctgcgggggc ctctgcagcc catggctggc acctgccccg cccccgagat ccgcgcgatt 2700 

gagcccctga gtggcccgtt ggacggtggg accctgctga ccatccgagg aaggaacctg 2760 

ggccggcggc tcagtgacgt ggcccacggc gtgtggattg gtggtgtggc ctgtgagcca 2820 

ctgcctgaca gatacacggt gtcggaggag atcgtgtgtg tcacagggcc agccccagga 2880 

ccgctctcag gtgtggtgac cgtgaacgcc tctaaggagg gcaagtcccg ggaccgcttc 2940 

tcccacgtgc tgcccctggt ccactccctg gagcctacca tgggccccaa ggccgggggc 3000 

accaggatca ccatccatgg gaatgacctc catgtaggct ccgagctcca ggtcctggtg 3060 

aacgacacag acccctgcac ggagctgatg cgcacagata ccagcatcgc ctgcaccatg 312 0 

cczgaggggg ccctgccggc tccggtgcct gtgtgtgtgc gcttcgagcg tcggggctgc 3180 

gtccacggca acctcacctt ctggtacatg cagaacccgg tcatcacggc catcagtccc 3240 

cgccgcagcc ctgtcagtgg cggcaggacc atcacagtgg ctggtgagcg tttccacatg 33 0 0 

gcgcagaatg tgtccatggc cgtccaccac attggccggg agcccacgct ctgcaaggtt 33 6 0 

ctcaactcca ccctcatcac ctgcccgtcc cccggggccc tgagcaacgc atcagcgcca 3420 

gtggacttct tcatcaatgg gcgggcctac gcagacgagg tggctgtggc tgaggagcta 348 0 

ctggaccccg aggaggcaca gcggggcagc aggttccgcc tggactacct ccccaaccca 3540 

cagTitctcta cggccaagag ggagaagtgg atcaagcacc accccgggga gcctctcacc 3 600 

ctcgttatcc acaaggagca ggacagcctg gggctccaga gtcacgagta ccgggtcaag 3660 

ataggccaag taagctgcga catccagatt gtctctgaca gaatcatcca ctgctcggnc 3720 

aacgagtccc tgggcgcggc cgtggggcag ctgcccatca caatccaggt agggaacttc 3780 
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aaccagacca tcgccacact gcagctgggg ggcagcgaga cggccatcat cgtgtccatc 3840 

gtcatctgca gcgtcctgct gctgctctcc gtggtggccc tgttcgtctt ctgtaccaag 3900 

agccgacgtg ctgagcgtta ctggcagaag acgctgctgc agatggagga gatggaatct 3960 

cagatccgag aggaaatccg caaaggcttc gctgagctgc agacagacat gacagatctg 4020 

accaaggagc tgaaccgcag ccagggcatc cccttcctgg agtataagca cttcgtgacc 4080 

cgcaccttct tccccaagtg ttcctccctt tatgaagagc gttacgtgct gccctcccag 4140 

accctcaact cccagggcag ctcccaggca caggaaaccc acccactgct gggagagtgg 4200 

aagattcctg agagctgccg gcccaacatg gaagagggaa ttagcgtgtt ctcctcacta 4260 

ctcaacaaca agcacttcct catcgtcttt gcccacgcgc tggagcagca gaaggacttt 4320 

gcggtgcgcg acaggtgcag cctggcctcg ctgctgacca tcgcgctgca cggcaagctg 4380 

gagtactaca ccagcatcat gaaggagctg ctggtggacc tcattgacgc ctcggccgcc 4440 

aagaacccca agctcatgct gcggcgcaca gagtctgtgg tggagaagat gctcaccaac 4500 

tggatgtcca tctgcatgta cagctgtctg cgggagacgg tgggggagcc attcttcctg 4560 

ctgctgtgtg ccatcaagca gcaaatcaac aagggctcca tcgacgccat cacaggcaag 4620 

gcccgctaca cactcaatga ggagtggctg ctgcgggaga acatcgaggc caagccccgg 4680 

aacctgaacg tgtccttcca gggctgtggc atggactcgc tgagcgtgcg ggccatggac 4740 

accgacacgc tgacacaggt caaggagaag atcctggagg ccttctgcaa gaatgtgccc 4800 

tactcccagt ggccgcgtgc agaggacgtc gaccttgagt ggttcgcctc cagcacacag 4860 

agctacatcc ttcgggacct ggacgacacc tcagtggtgg aagacggccg caagaagctt 4920 

aacacgctgg cccattacaa gatccctgaa ggtgcctccc tggccatgag tctcatagac 4980 

aagaaggaca acacactggg ccgagtgaaa gacttggaca cagagaagta tttccatttg 5040 

-g-tgctgccta cggacgagc-t ..ggcggagccc .aagaagtctc ^accggcagag -ccatcgcaag -5100 

aaggtgctcc cggaaatcta cctgacccgc ctgctctcca ccaagggcac gttgcagaag 5160 

tttctggatg acctgttcaa ggccattctg agtatccgtg aagacaagcc cccactggct 5220 

gtcaagtact ttttcgactt cctggaggag caggctgaga agaggggaat ctccgacccc 5280 

gacaccctac acatctggaa gaccaacagc cttcctctcc ggttctgggt gaacatcctg 5340 

aagaaccccc agtttgtctt tgacatcgac aagacagacc acatcgacgc ctgcctttca 5400 

gtcatcgcgc aggccttcat cgacgcctgc tccatctctg acctgcagct gggcaaggat 5460 

tcgccaacca acaagctcct ctacgccaag gagattcctg agtaccggaa gatcgtgcag 5520 

cgctactaca agcagatcca ggacatgacg ccgctcagcg agcaagagat gaatgcccat 5580 

ctggccgagg agtcgaggaa ataccagaat gagttcaaca ccaatgtggc catggcagag 5640 

atttttaggt cgcccaagag gtatcggccg cagatcatgg ccgcgctgga ggccaacccc 5700 

acggcccgga ggacacaact gcagcacaag tttgagcagg tggtggcttt gatggaggac 5760 

aacatctacg agtgctacag tgaggcctga gacacatgga gagttggtca ggctgctgct 5820 

gggagaaatg gacgcccact gggcctcaac ttgatcttct accccgtgcc tgtgactcag 5880 

actgggaaat ac 5892 



<210> 6 

<211> 1925 

<212> PRT 

<213> HOMO SAPIEN 

<400> 6 

Met Ala Leu Arg Ala Ala Gly Gly Ala Pro Phe Ser Gly Pro Ala Ala 

1 5 10 15 

Ala Ala Ser Pro Pro Pro Phe Gin Thr Pro Pro Arg Cys Pro Val Pro 

20 25 30 

Leu Leu Leu Leu Leu Leu Leu Gly Ala Ala Arg Ala Gly Ala Leu Glu 

35 40 45 

lie Gin Arg Arg Phe Pro Ser Pro Thr Pro Thr Asn Asn Phe Ala Leu 

50 ^ 55 60 

Asp Gly Ala Ala Gly Thr Val Tyr Leu Ala Ala Val Asn Arg Leu Tyr 
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Gin Leu Ser Gly Ala Asn Leu Ser Leu Glu Ala Glu Ala Ala Val Gly 

85 90 95 

Pro Val Pro Asp Ser Pro Leu Cys His Ala Pro Gin Leu Pro Gin Ala 

100 105 110 

Ser Cys Glu His Pro Arg Arg Leu Thr Asp Asn Tyr Asn Lys ile Leu 

115 120 125 

Gin Leu Asp -Pro Gly Gin Gly Leu Val Val Val Cys Gly Ser lie Tyr 

130 135 140 

Gin Gly Phe Cys Gin Leu Arg Arg Arg Gly Asn Ile Ser Ala Val Ala 
145 150 155 160- 

Val Arg Phe Pro Pro Ala Ala Pro Pro Ala Glu Pro Val Thr Val Phe 

165 170 175 

Pro Ser Met Leu Asn Val Ala Ala Asn His Pro Asn Ala Ser Thr Val 

180 185 190 

Gly Leu Val Leu Pro Pro Ala Ala Gly Ala Gly Gly Ser Arg Leu Leu 

195 200 205 

Val Gly Ala Thr Tyr Thr Gly Tyr Gly Ser Ser Phe Phe Pro Arg Asn 

210 215 220 

Arg Ser Leu Glu Asp His Arg Phe Glu Asn Thr Pro Glu Ile Ala Ile 
225 230 235 240 

Arg Ser Leu Asp Thr Arg Gly Asp Leu Ala Lys Leu Phe Thr Phe Asp 

. . 2.45 _ . . I. .25.0 . . _ 255 

Leu Asn Pro Ser Asp Asp Asn Ile Leu Lys Ile Lys Gin Gly Ala Lys 

260 265 270 

Glu Gin His Lys Leu Gly Phe Val Ser Ala Phe Leu His Pro Ser Asp 

275 280 285 

Pro Pro Pro Gly Ala Gin Ser Tyr Ala Tyr Leu Ala Leu Asn Ser Glu 

290 295 300 

Ala Arg Ala Gly Asp Lys Glu Ser Gin Ala Arg Ser Leu Leu Ala Arg 
305 310 315 320 

Ile Cys Leu Pro His Gly Ala Gly Gly Asp Ala Lys Lys Leu Thr Glu 

325 330 335 

Ser Tyr Ile Gin Leu Gly Leu Gin Cys Ala Gly Gly Ala Gly Arg Gly 

340 345 350 

Asp Leu Tyr Ser Arg Leu Val Ser Val Phe Pro Ala Arg Glu Arg Leu 

355 -360 365 

Phe Ala Val Phe Glu Arg Pro Gin Gly Ser Pro Ala Ala Arg Ala Ala 

370 375 380 

Pro Ala Ala Leu Cys Ala Phe Arg Phe Ala Asp Val Arg Ala Ala Ile 
385 390 395 400 

Arg Ala Ala Arg Thr Ala Cys Phe Val Glu Pro Ala Pro Asp Val Val 

405 410 415 

Ala Val Leu Asp Ser Val Val Gin Gly Thr Gly Pro Ala Cys Glu Arg 

420 425 430 

Lys Leu Asn Ile Gin Leu Gin Pro Glu Gin Leu Asp Cys Gly Ala Ala 

435 440 445 

His Leu Gin His Pro Leu Ser Ile Leu Gin Pro Leu Lys Ala Thr Pro 

450 455 460 

Val Phe Arg Ala Pro Gly Leu Thr Ser Val Ala Val Ala Ser Val Asn 
465 . 470 475 480 

Asn Tyr Thr Ala Val Phe Leu Gly Thr Val Asn Gly Arg Leu Leu Lys 

485 490 495 

Ile Asn Leu Asn Glu Ser Met Gin Val Val Ser Arg Arg Val Val Thr 

500 505 510 

Val Ala Tyr Gly Glu Pro Val His His Val Met Gin Phe Asp Pro Ala 
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515 




Asp 


Ser 


Gly 


Tyr 




530 






Lys 


Val 


Ala 


Ala 


545 








Ala 


Ala 


Asp 


Ala 


Leu 


Gin 


Gin 


Asp 








con 


Ala 


Ser 


Glu 


Gly 






595 




Glu 


He 


Asp 


Val 




610 






Gly 


Ser 


Leu 


Pro 


625 








Asn 


Asn 


He 


Arg 


Gin 


He 


Ala 


Tyr 








O O U 


Pro 


Pro 


Asn 


Gin 






675 




-Gly 


-Arg 


Asn 


He 




690 






Thr 


Ala 


Gin 


Val 


705 








Gin 


Trp 


Pro 


Cys 


Gin 


Ser 


Arg 


Cys 








n a c\ 
/ 4U 


Pro 


Arg 


Thr 


Leu 






755 




Gin 


Asn 


He 


Leu 




770 






Ala 


Leu 


Glu 


Cys 


785 








Val 


Asn 


Glu 


Ser 


Arg 


Lys 


Ser 


Gin 








□ O fi 


Ala 


Arg 


Phe 


Leu 






835 




Cys 


Ala 


Met 


Gly 




850 






Leu 


Gly 


His 


Leu 


865 








Leu 


Gin 


Pro 


Met 


Glu 


Pro 


Leu 


Ser 








900 


Gly 


Arg 


Asn 


Leu 






915 




lie 


Gly 


Gly 


Val 




930 






Glu 


Glu 


He 


Val 



945 



520 

Leu Tyr Leu Met 
535 

Cys Asn Val His 
550 

Tyr Cys Gly Trp 
565 

Cys Thr Asn Ser 

Pro Ser Arg Cys 
600 

Arg Gin Glu Tyr 
615 

Ser Leu Ser Gly 
630 

Thr Val Ala Arg 
645 

Cys Asn Leu Leu 

Asp His Val Thr 
680 

Val Lys Ala - Asn 
695 

Tyr Pro His Thr 
710 

Phe Trp Cys Ser 
725 

Glu Ala Ser Pro 

Leu Ser Pro Leu 
760 

Val Pro Leu Ala 
775 

Ser Phe Gly Leu 
790 

Val Val Arg Cys 
805 

Val Phe Pro Leu 

Asp Ser Pro Glu 
840 

Ser Pro Asp Cys 
855 

Cys Val Trp Ser 
870 

Ala Gly Thr Cys 
885 

Gly Pro Leu Asp 

Gly Arg Arg Leu 
920 

Ala Cys Glu Pro 
935 

Cys Val Thr Gly 
950 



Thr Ser His Gin 
540 

Ser Thr Cys Gly 
555 

Cys Ala Leu Glu 
570 

Ser Gin Gin His 
585 

Pro Ala Met Thr 

Pro Gly Met He 
620 

Met Glu Met Ala 
635 

Val Pro Gly Pro 
650 

Pro Arg Asp Gin 
665 

Val Glu Met Ser 

Phe Thr -He -Tyr 
. 700 

Ala Cys Thr Ser 
715 

Gin Gin His Ser 
730 

Asn Pro Thr Ser 
745 

Ala Pro Val Pro 

Asn Thr Ala Phe 
780 

Glu Glu He Phe 
795 

Asp Gin Val Val 
810 

Ser Leu Gin Leu 
825 

Pro Met Thr Val 

Ser Gin Cys Leu 
860 

Asp Gly Cys Arg 
875 

Pro Ala Pro Glu 
890 

Gly Gly Thr Leu 
905 

Ser Asp Val Ala 

Leu Pro Asp Arg 
940 

Pro Ala Pro Gly 
955 



525 








Met 


Ala 


Arg 


Val 


Asp 


Cys 


Val 


Gly 








560 


Thr 


Arg 


Cys 


Thr 






575 




Phe 


Trp 


Thr 


Ser 




590 






Val 


Leu 


Pro 


Ser 


605 








Leu 


Gin 


He 


Ser 


Cys 


Asp 


Tyr 


Gly 








640 


Ala 


Phe 


Gly 


His 






655 




Phe 


Pro 


Pro 


Phe 




670 






Val 


Arg 


Val 


Asn 


685 


-Cys' 






Asp 


-Ser 


Arg 


Cys 


Leu 


Ser 


Ala 






720 


Cys 


Val 


Ser 


Asn 




735 




Pro 


Gin 


Asp 


Cys 




750 






Thr 


Gly 


Gly 


Ser 


765 








Phe 


Gin 


Gly 


Ala 


Glu 


Ala 


Val 


Trp 








800 


Leu 


His 


Thr 


Thr 






815 




Lys 


Gly 


Arg 


Pro 




830 






Met 


Val 


Tyr 


Asn 


845 








Gly 


Arg 


Glu 


Asp 


Leu 


Arg 


Gly 


Pro 








880 


He 


Arg 


Ala 


He 






895 




Leu 


Thr 


He 


Arg 




910 






His 


Gly 


Val 


Trp 


925 








Tyr 


Thr 


Val 


Ser 


Pro 


Leu 


Ser 


Gly 








960 
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Val 


Val Thr Val 


Asn 


Ala 


Ser Lys Glu 


Gly Lys 


Ser Arg 


Asp 


Arg Phe 






965 






970 






975 


Ser 


Tyr Val Leu 


Pro 


Leu 


Val His Ser 


Leu Glu 


Pro Thr 


Met 


Gly Pro 




980 






985 






990 




Lys 


Ala Gly Gly 


Thr 


Arg 


He Thx lie 


His Gly 


Asn Asp 


Leu 


His Val 




995 






1000 




1005 




Gly 


Ser Glu Leu 


Gin 


Val 


Leu Val Asn 


Asp Thr 


Asp Pro 


Cys 


Thr Glu 




1010 






1015 




1020 






Leu 


Met Arg Thr 


Asp 


Thr 


Ser He Ala 


Cys Thr 


Met Pro 


Glu 


Gly Ala 


1025 




1030 


1035 




1040 


Leu 


Pro Ala Pro 


Val 


Pro 


Val Cys Val 


Arg Phe 


Glu Arg 


Arg 


Gly Cys 






1045 




1050 






1055 


Val 


His Gly Asn 


Leu 


Thr 


Phe Trp Tyr 


Met Gin 


Asn Pro 


Val 


He Thr 




1060 




1065 




1070 . 


Ala 


lie Ser Pro 


Arg 


Arg 


Ser Pro Val 


Ser Gly 


Gly Arg 


Thr 


He Thr 




1075 






1080 




1085 




Val 


Ala Gly Glu 


Arc 


Phe 


His Met Val 


Gin Asn 


Val Ser 


Met 


Ala Val 




1090 






1095 




1100 






His 


His lie Gly 


Arg 


Glu 


Pro Thr Leu 


Cys Lys 


Val Leu 


Asn 


Ser Thr 


1101 




1110 


1115 




1120 


Leu 


lie Thr Cys 


Pro 


Ser 


Pro Gly Ala 


Leu Ser 


Asn Ala 


Ser 


Ala Pro 




1125 




1130 






1135 


Val 


Asp Phe Phe 


He 


Asn Gly Arg Ala 


Tyr Ala 


Asp Glu 


Val 


Ala Val 




1140 




1145 




1150 


Ala 


Glu Glu Leu 


Leu 


Asp 


Pro Glu Glu 


Ala Gin 


Arg Gly 


Ser 


Arg Phe 




1155 






1160 




1165 




Arg Leu Asp Tyr 


Leu 


Pro 


Asn Pro Gin 


Phe Ser 


Thr Ala 


Lys 


Arg Glu 




1170 






1175 




1180 






Lys 


Trp lie Lys 


His 


His 


Pro Gly Glu 


Pro Leu 


Thr Leu 


Val 


He His 


118 5 




1190 




1195 




1200 


Lys 


Glu Gin Asp 


Ser 


Leu Gly Leu Gin 


Ser His 


Glu Tyr Arg Val Lys 




1205 






1210 






1215 



He Gly Gin Val Ser Cys Asp lie Gin He Val Ser Asp Arg He He 

1220 1225 1230 

His Cys Ser Val Asn Glu Ser Leu Gly Ala Ala Val Gly Gin Leu Pro 

1235 1240 1245 

He Thr He Gin Val Gly Asn Phe Asn Gin Thr He Ala Thr Leu Gin 

1250 * 1255 1260 

Leu Gly Gly Ser Glu Thr Ala He He Val Ser He Val He Cys Ser 
1265 1270 " 1275 1280 

Val Leu Leu Leu Leu Ser Val Val Ala Leu Phe Val Phe Cys Thr Lys 

1285 1290 1295 

Ser Arg Arg Ala Glu Arg Tyr Trp Gin Lys Thr Leu Leu Gin Met Glu 

1300 1305 1310 

Glu Met Glu Ser Gin lie Arg Glu Glu lie Arg Lys Gly Phe Ala Glu 

1315 1320 1325 

Leu Gin Thr Asp Met Thr Asp Leu Thr Lys Glu Leu Asn Arg Ser Gin 

1330 1335 1340 

Gly He Pro Phe Leu Glu Tyr Lys His Phe Val Thr Arg Thr Phe Phe 
1345 1350 1355 1360 

Pro Lys Cys Ser Ser Leu Tyr Glu Glu Arg Tyr Val Leu Pro Ser Gin 

1365 1370 1375 

Thr Leu Asn Ser Gin Gly Ser Ser Gin Ala Gin Glu Thr His Pro Leu 

1380 1385 1390 

Leu Gly Glu Trp Lys He Pro Glu Ser Cys Arg Pro Asn Met Glu Glu 
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1395 1400 1405 

Gly lie Ser Val Phe Ser Ser Leu Leu Asn Asn Lys His Phe Leu lie 

1410 1415 1420 

Val Phe Val His Ala Leu Glu Gin Gin Lys Asp Phe Ala Val Arg Asp 
1425 1430 1435 1440 

Arg Cys Ser Leu Ala Ser Leu Leu Thr lie Ala Leu His Gly Lys Leu 

1445 1450 1455 

Glu Tyr Tyr Thr Ser lie Met Lys Glu Leu Leu Val Asp Leu lie Asp 

1460 1465 1470 

Ala Ser Ala Ala Lys Asn Pro Lys Leu Mec Leu Arg Arg Thr Glu Ser 

1475 1480 1485 

Val Val Glu Lys Met Leu Thr Asn Trp Met Ser lie Cys Met Tyr Ser 

1490 1495 1500 

Cys Leu Arg Glu Thr Val Gly Glu Pro Phe Phe Leu Leu Leu Cys Ala 
1505 1510 1515 1520 

lie Lys Gin Gin lie Asn Lys Gly Ser He Asp Ala He Thr Gly Lys 

1525 1530 1535 

Ala Arg Tyr Thr Leu Asn Glu Glu Trp Leu Leu Arg Glu Asn He Glu 

1540 1545 1550 

Ala Lys Pro Arg Asn Leu Asn Val Ser Phe Gin Gly Cys Gly Met Asp 

1555 1560 1565 

Ser Leu Ser Val .Aug Ala JMe.t Asp Thr Asp JThr Leu Thr .Gin JVal JLys 

1570 1575 1580 

Glu Lys He Leu Glu Ala Phe Cys Lys Asn Val Pro Tyr Ser Gin Trp 
1585 1590 1595 1600 

Pro Arg Ala Glu Asp Val Asp Leu Glu Trp Phe Ala Ser Ser Thr Gin 

1605 1610 1615 

Ser Tyr He Leu Arg Asp Leu Asp Asp Thr Ser Val Val Glu Asp Gly 

1620 1625 1630 

Arg Lys Lys Leu Asn Thr Leu Ala His Tyr Lys He Pro Glu Gly Ala 

1635 1640 1645 

Ser Leu Ala Met Ser Leu He Asp Lys Lys Asp Asn Thr Leu Gly Arg 

1650 1655 1660 

Val Lys Asp Leu Asp Thr Glu Lys Tyr Phe His Leu Val Leu Pro Thr 
1665 1670 1675 1680 

Asp Glu Leu Ala Glu Pro Lys Lys Ser His Arg Gin Ser His Arg Lys 

1685 1690 1695 

Lys Val Leu Pro Glu He Tyr Leu Thr Arg Leu Leu Ser Thr Lys Gly 

1700 1705 1710 

Thr Leu Gin Lys Phe Leu Asp Asp Leu Phe Lys Ala He Leu Ser He 

1715 1720 1725 

Arg Glu Asp Lys Pro Pro Leu Ala Val Lys Tyr Phe Phe Asp Phe Leu 

1730 1735 1740 

Glu Glu Gin Ala Glu Lys Arg Gly He Ser Asp Pro Asp Thr Leu His 
1745 1750 1755 1760 

He Trp Lys Thr Asn Ser Leu Pro Leu Arg Phe Trp Val Asn He Leu 

1765 1770 1775 

Lys Asn Pro Gin Phe Val Phe Asp He Asp Lys Thr Asp His He Asp 

1780 1785 1790 

Ala Cys Leu Ser Val He Ala Gin Ala Phe He Asp Ala Cys Ser He 

1795 1800 1805 

Ser Asp Leu Gin Leu Gly Lys Asp Ser Pro Thr Asn Lys Leu Leu Tyr 

1810 1815 1820 

Ala Lys Glu He Pro Glu Tyr Arg Lys He Val Gin Arg Tyr Tyr Lys 
1825 1830 1835 1840 
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Gin He Gin Asp Met Thr Pro Leu Ser Glu Gin Glu Met Asn Ala His 

1845 1850 1855 

Leu Ala Glu Glu Ser Arg Lys Tyr Gin Asn Glu Phe Asn Thr Asn Val 

1860 1865 1870 

Ala Met Ala Glu lie Phe Arg Ser Pro Lys Arg Tyr Arg Pro Gin He 

1875 1880 1885 

Met Ala Ala Leu 1 Glu Ala Asn Pro Thr Ala Arg Arg Thr Gin Leu Gin 

1890 1895 19a0 

His Lys Phe Glu Gin Val Val Ala Leu Met Glu Asp. Asn lie Tyr Glu 
1905 1910 1915 1920 

Cys Tyr Ser Glu Ala 
1925 

<210> 7 

<211> 601 

<212> DNA 

<213> HOMO SAPIEN 



<400> 7 

caccagagtc 

aggcgacccc 

-t gage gg tee 

ggtccatccc 

caatgtcccg 

tgggctggtc 

gatcatcaca 

cggcatgacc 

cctgtcctgc 

ccatgacccc 

c 



cctgtggagt 
cactgtggct 
-aaggage-eec 
aacaatatct 
gagctgtcag 
gtgggcaatc 
gagaatgggg 
ttcgccagca 
gtggagagtc 
aagacctget 



cctgtggtca 
ggtgtgtgct 
-geagg-t-tfcge 
ccgtctctca 
ctggcgtcaa 
agatccagtg 
accaccatgt 
ccagctttgt 
cataccgctg 
ccttccagga 



gtatcagagc 
gcacaacact 
-efeeggagatg 
gtacaacgcg 
ctgcaccttt 
ctactcccct 
egtacagett 
cttctacaat 
ccactggtgt 
aggecgagtg 



tgcggcgagt 
tgcacccgga 
aageagtgtg 
ctgctggtcc 
gaggacctgt 
geagecaagg 
cagctcaaat 
tgcagcgtcc 
aaatacegge 
aagctgcccg 



gccttggctc 
aggageggtg 
-teeggctgae 
tggagacgta 
cagagatgga 
aggtgccccg 
caaaggagac 
acaattegtg 
atgtctgcac 
aggtaggtcc 



60 
120 

. iao 

240 
300 
360 
420 
480 
540 
600 
601 



<210> 8 

<211> 199 

<212> PRT 

<213> HOMO SAPIEN 



<400> 8 






























Thr 


Arg 


Val 


Pro 


Val 


Glu 


Ser 


Cys 


Gly 


Gin 


Tyr 


Gin 


Ser 


Cys 


Gly 


Glu 


1 








5 










10 










15 




Cys 


Leu 


Gly 


Ser 
20 


Gly 


Asp 


Pro 


His 


Cys 
25 


Gly 


Trp 


Cys 


Val 


Leu 
30 


His 


Asn 


Thr 


Cys 


Thr 


Arg 


Lys 


Glu 


Arg 


Cys 


Glu 


Arg 


Ser Lys 


Glu 


Pro 


Arg 


Arg 






35 










40 










45 








Phe 


Ala 


Ser 


Glu 


Met 


Lys 


Gin 


Cys 


Val 


Arg 


Leu 


Thr 


Val 


His 


Pro 


Asn 




50 








55 








60 










Asn 


He 


Ser 


Val 


Ser 


Gin 


Tyr 


Asn 


Ala 


Leu 


Leu 


Val 


Leu 


Glu 


Thr 


Tyr 


65 










70 










75 










80 


Asn 


Val 


Pro 


Glu 


Leu 


Ser 


Ala 


Gly 


Val 


Asn 


Cys 


Thr 


Phe 


Glu 


Asp 


Leu 










85 








90 








95 




Ser 


Glu 


Met 


Asp 


Gly 


Leu 


Val 


Val 


Gly 


Asn 


Gin 


He 


Gin 


Cys 


Tyr 


Ser 








100 








105 










110 






Pro 


Ala 


Ala 
115 


Lys 


Glu 


Val 


Pro 


Arg 
120 


He 


He 


Thr 


Glu 


Asn 
125 


Gly 


.Asp 


His 


His 


Val 
130 


Val 


Gin 


Leu 


Gin 


Leu 
135 


Lys 


Ser 


Lys 


Glu 


Thr 
140 


Gly 


Met 


Thr 


Phe 
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Ala Ser Thr Ser Phe Val Phe Tyr Asn Cys Ser Val His Asn Ser Cys 
145 150 155 160 

Leu Ser Cys Val Glu Ser Pro Tyr Arg Cys His Trp Cys Lys Tyr Arg 

165 170 175 

His Val Cys Thr Asp Pro Lys Thr Cys Ser Phe Gin Glu Gly Arg Val 

180 185 190 

Lys Leu Pro Glu Val Gly Pro 
195 

<210> 9 

<211> 6408 

<212> DNA 

<213> HOMO SAPIEN 



<400> 9 

atgcctgctc 

cagccccttc 

gaccccacct 

gggctgcagc 

ccacctgtga 

ctcctggtga 

Gagcggcgcc 

caatacgtgg 

gcaggggagc 

attccaccca 

gaggagacag 

gcctttgcac 

tctagagctt 

tatgtggagt 

gtggccacgt 

gcacccccca 

ctctgtgcct 

tacacccggg 

aattctgact 

cacacgccca 

ccagggattc 

ctgggtgata 

ccatactcca 

gggacctttg 

tcctgtgctc 

tggtgcgtgc 

gagcagtggc 

cctgccaaca 

cccctgcggc 

ctgactggtt 

agaggagccg 

aaaacttccc 

tgccaggcct 

acccacaagg 

tccccagacc 

ctggccaccc 

gcttcggaca 

ggttctggct 

cctcccagcc 

tcagccacac 



tgggcccagc 
caccaactgc 
caggcaccct 
tggaggccac 
cgcctgatga 
gcccaggggc 
tggggcagc-t 
ctgccaatga 
ccctcctgtt 
tcacaacccg 
ccaagctggc 
gtggggccag 
ttcgtgccta 
tgcctctggc 
ccagggaggt 
ctgtgggccg 
tccccctgga 
agggtcgcgc 
gtgcacagct 
gccccatggc 
agctaacagc 
gtcaagggca 
cacagagcat 
agcacctgta 
agcacctgga 
cccttggcag 
tatggagctt 
tcagccgaga 
caggggagcc 
ctggtgtgat 
actacgtatc 
tctctttcta 
gtgtgagcag 
cctcgtgtga 
ctcctgcaag 
ctgctcctga 
tctcacctgg 
ccatatcttc 
cccaaaatgg 
ctgaggacct 



tcttctccag 
attcactccc 
ctacctgggg 
agtgtccacc 
gtgcccccag 
cctggtggta 
-egagcagctg 
tcctgcggtc 
tgtggggcga 
ggccctgtgg 
agtgggccgc 
cgcctacttc 
tgtatctcga 
ctgcgaaggt 
ggcgcatggg 
gcccccatcg 
tgaggtggac 
tgaggatggg 
gccagtggac 
cagccgggtc 
tgtggcagtc 
gctgcacagg 
ccagcagggg 
tgtcatgacc 
ctgtgcatct 
gtgcagtcgc 
ccagcctgag 
ggagacgagg 
atattcctgc 
gtgcccctcc 
cgtgagcgtg 
tgactgtgtg 
ccgctggggg 
tgctgggccc 
aggtggaccc 
cacccttccc 
ggc tag tcct 
ccctggctcc 
acctggaacc 
cttggcctcc 



gctctctggg 
aatggcacgt 
gccaccaact 
ggccctgtgc 
gcccagccta 
cgcgggagcg 
-ctgctgcggc 
agcacggtgg 
ggatacacca 
ccgcccgacc 
ccctccgagt 
ctgttcctgc 
gtgtgtctcc 
ggccgccacg 
gaggtgctct 
gcggctgctg 
cggcttgcta 
accgaggtgg 
accccggatg 
ccgctggaag 
accatggaag 
gtctacttgg 
tctgcagtga 
cagagcacac 
tgccttgctc 
cgttctgagt 
ctgggcngtc 
gaggttttcc 
cactttgggg 
ccagacccta 
gagctcagat 
gcggtcactg 
tgtaactggt 
atggttgcaa 
agcccctccc 
gtggagcctg 
tccctgctca 
acagggccgc 
gctgtccctg 
ccgctgtcac 



ccgggtgggt 
atctgcagca 
tcctgt ticca 
tagacagcag 
ccaacaaccc 
tgcaccaggg 
-cagagcggcc 
ggctggtagc 
gcaggggtgt 
cccaagctgc 
acagccacca 
ggcgggacct 
gggaccagca 
ggctgatcca 
ttgcagccct 
gggcatctgg 
atcgcacgcg 
cctacatcga 
cttatccctg 
ccacaccaat 
atggacacac 
gcccagggag 
gcagagacct 
ttctgaaggt 
acagggaccc 
gctcgagggg 
tgcaagtggc 
tatcagtgcc 
aacatcagag 
gtgaggcccc 
ttggcgctgt 
aactccgccc 
gtgtctggca 
gccatcagag 
cacccacagc 
gggctccctc 
gcccctgggg 
ctctccatga 
cccccactga 
cgtcagaggt 



cctcaccctc 
cctggcaagg 
gctgagccct 
ggaccgcctg 
gaatcagctg 
ggtccgtgaa 
-tggggacaca 
ccagggcttg 
ggggggtggc 
cttctcctat 
ctccgtgagt 
gcaggctcag 
ctactactcc 
ggctgcagct 
ctcctcggct 
agcctctgcc 
agatgcctgc 
gtatgatgtc 
tggctcagac 
tctggagtgg 
catcgctttc 
cgatggccac 
cacctttgat 
tcctgtggct 
atactgtggg 
ccagggccca 
agccatgagt 
agacctgcca 
tcctgccctg 
agtgctgccg 
tgtgatcgcc 
atctgcgcag 
gcacctgtgc 
cccgcttgtc 
ccccaaagcc 
cacagccaca 
gccatgggca 
ggagccctcc 
cttcagaccc 
agcagcagtg 



60 
120 
180 
240 
300 
360 
4-20 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
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ccccctgcag accctggccc cgaggctctt catcccacag tgcccctgga cctgccccct 2460 

gccactgttc ctgccaccac tttcccaggg gccatgggct ccgtgaagcc cgccctggac 2520 

tggctcacga gagaaggcgg cgagctgccc gaggcggacg agtggacggg gggtgacgca 2580 

cccgccttct ccacttccac cctcctctca ggtgatggag actcagcaga gcttgagggc 2640 

ccccccgccc ccctcatcct cccgtccagc ctcgactacc agtatgacac ccccgggctc 2700 ■ 

tgggagctgg aagaggcgac cttgggggca agctcctgcc cctgtgtgga gagcgttcag 2760 

ggctccacgc tgatgccggt ccatgtggag cgggaaaccc ggctgctagg caggaacctg 2820 

caccttttcc aggatggccc aggagacaat gagtgtgtga tggagctgga gggcctcgag 2880 

gtggtggttg aggcccgggt cgagcgtgag ccacctccag atacccagtg ccatgtcacc 2940 

tgccagcagc accagctcag cratgaggct ctgcagccgg agctccgtgt ggggctgttt 3000 

ccgcgccggg ccggccgtct gcgtgtggac agtgctgagg ggctgcatgt ggtactgtat 3060 

gaccgttccg tgggacatgg agactgcagc cgccgccaaa ctgccatgcc ccagtatggc 3120 

tgcgcgtggc gtgaggggga gcgtccacgc tgtgtgaccc - gggaggcctg tggtgaggct 3180 

gaggctgtgg ccacccagcg cccagcgccc ctcatccact cggtggagcc actgactggg 3240 

ccrgtagacg gaggcacccg tgtcaccatc aggggctcca acctgggcca gcatgtgcag 3300 

gacgtgctgg gcatggtcac ggtggctgga gtgccctgtg ccgtggatgc ccaggagtac 3360 

gaggtctcca gcagcctcgt gtgcatcacc ggggccagcg gggaggaggt ggccggcgcc 3420 

acagcggtgg aggcgccggg aagaggacgt ggtgtctcag aacacgactt tgcctaccag 3480 

gacccgaagg tccattccat cttcccggcc cgcggcccca gagctggggg cacccgtctc 3540 

accctgaatg gctccaagct cctgactggg cggctggagg acatccgagt ggtggttgga 3600 

gaccagccct gtcacttgct gcccgagcag cagtcagaac aactgcggtg tgagaccagc 3660 

-ecacgeccca egcetgceac gc-tce-ctgtg -gc-tgfcg-tggt -ttggggcoaG ^gagcggagg -3-7-2-0 

ctncaacgcg gacagttcaa gtataccttg gaccccaaca tcacctctgc tggccccacc 3780 

aagagcttcc tcagtggagg acgtgagata tgcgtccgtg gccagaatct ggacgtggta 3840 

cagacgccaa gaacccgggt gaccgtggtc tcgagaatgc tgcagcccaa ccaggggctt 3900 

ggacggaggc gtcgcgtggt cccggagacg gcatgttccc ttggaccctc ctgcagtagc 3960 

cagcaatttg aggagccgtg ccatgtcaac tcctcccagc tcatcacgtg ccgcacacct 402 0 

gccctcccag gcctgcctga ggacccctgg gtccgggtgg aatttatcct tgacaacctg 4080 

gtcrttgacc ttgcaacact gaaccccaca cctttctcct atgaggccga ccccaccctg 4140 

cagccactca accctgagga ccccaccacg ccattccggc acaagcctgg gagtgtgttc 4200 

tccgtggagg gggagaacct ggaccttgca atgtccaagg aggaggtggt ggctangata 4260 

ggggatggcc cctgtgtggt gaagacgctg acgcggcacc acctgtactg cgagcccccc 432 0 

gtggagcagc ccctgccacg gcaccatgcc ctccgagagg cacctgactc tttgcctgag 4380 

ttcacggtgc agatggggaa cttgcgcttc tccctgggtc acgtgcagta tgacggcgag 4440 

agccctgggg cttttcctgt ggcagcccag gtgggcttgg gggtgggcac ctctcttctg 4500 

gctctgggtg tcatcatcat tgtcctcatg tacaggagga agagcaagca ggccctgagg 4560 

gactataaga aggttcagat ccagctggag aatctggaga gcagtgtgcg ggaccgctgc 4620 

aagaaggaat: tcacagacct catgactgag atgaccgatc tcaccagtga cctcctgggc 4680 

agcggcatcc ccttcctcga ctacaaggtg tatgcggaga ggatcttctt ccctgggcac 474 0 

cgcgagtcgc ccttgcaccg ggacctgggt gtgcctgaga gcagacggcc cactgtagag 4800 

caagggctgg ggcagctctc taacctgctc aacagcaagc tcttcctcac caagttcatc 4860 

cacacgctgg agacccagcg caccttttca gctcgggacc gtgcctacgt ggcatctctg 492 0 

ctcaccgtgg cactgcatgg gaagcttgag tatttcactg acatcctccg cactctgctc 4980 

agcgacctgg ttgcccagta tgtggccaag aaccccaagc tgatgctgcg caggacagag 504 0 

acrgtggtgg agaagctgct caccaactgg atgtccatct gtctgtatac cttcgtgagg 5100 

gacrccgtag gggagcctct gtacatgctc tttcgaggga ttaagcacca agtggataag 5160 

ggcccagtgg acagtgtgac aggcaaggcc aaatacacct tgaacgacaa ccgcctgctc 522 0 

agagaggatg tggagtaccg tcccctgacc ttgaatgcac tattggctgt ggggcctggg 5280 

gcaggagagg cccagggcgt gcccgtgaag gtcctagact gtgacaccat ctcccaggca 5340 

aacgagaaga tgctggacca gctttataaa ggagtgcctc tcacccagcg gccagaccet 5400 

cgcacccttg atgttgagtg gcggtctggg gtggccgggc acctcattct ttctgacgag 5460 

gacgtcactc ctgaggtcca gggtctgtgg aggcgcctga acacactgca gcattacaag 5520 

gtcccagatg gagcaactgt ggccctcgtc ccctgcctca ccaagcatgt gctccgggaa 5580 

aaccaggatc atgtccctgg agagcggacc ccaatgctgg aggatgtaga tgaggggggc 5640 

atccggcccc ggcacctggt gaagccaagt gatgagccgg agccgcccag gcctcggagg 5700 
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ggcagccttc 
cgcctgctgt 
ctcagcacca 
caggcccagc 
ttgcctctga 
acatctgata 
accctggccg 
gacattcccc 
ccagccagcg 
gacctcgggg 
cagatcatca 
ctccagcaga 



ggggcgggga 
ccatgaaggg 
gccgccccgt 
agcatggcat 
ggttctggat 
acatggatgc 
accacaagct 
ggtacaagcg 
accaagagat 
cgcgagtggc 
ctgccctgga 
ttgcagctgc 



gcgtgagcgc 
caccctgcag 
gccgctcgct 
ctccgaccag 
caatataata 
ggtgctcctt 
gggccgggac 
gatggtggaa 
gaaccctgtc 
cctgcatgaa 
ggaggatggc 
tgtggaaaac 



gccaaggcca 
aagttcgtgg 
gtgaagtact 
gacaccatcc 
aaaaacccgc 
gtcattgcac 
tccccgatca 
aggtactatg 
ctggctgaac 
ctctacaagt 
acggcccaga 
aaggtcacag 



tccctgagat 
atgacctgtt 
tctttgacct 
acatctggaa 
agtttgtgtt 
agaccttcat 
acaaacttct 
cagacatcag 
tgtcctggaa 
acatcaacaa 
agatgcagct 
atccatag 



ctacctgacc 
ccaggtgatt 
gctggatgag 
gaccaacagc 
cgacgtgcaa 
ggacgcctgc 
gtatgcacgg 
acagactgtc 
ccactccgga 
gtactatgac 
gggctatcgg 



5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
624 0 
6300 
6360 
6408 



<210> 10 

<211> 2135 

<212> PRT 

<213> HOMO SAPIEN 

„Met°Pro°Ala Leu _Gly Pro Ala Le.u Le.u _Gln Ala Leu Trp Ala _G|y Trp 

! 5 10 15 

Val Leu Thr Leu Gin Pro Leu Pro Pro Thr Ala Phe Thr Pro Asn Gly 

20 25 30 

Thr Tyr Leu Gin His Leu Ala Arg Asp Pro Thr Ser Gly Thr Leu Tyr 

35 40 45 

Leu Gly Ala Thr Asn Phe Leu Phe Gin Leu Ser Pro Gly Leu Gin Leu 

50 55 60 

Glu Ala Thr Val Ser Thr Gly Pro Val Leu Asp Ser Arg Asp Cys Leu 
65 70 75 80 

Pro Pro Val Met Pro Asp Glu Cys Pro Gin Ala Gin Pro Thr Asn Asn 

85 90 95 

Pro Asn Gin Leu Leu Leu Val Ser Pro Gly Ala Leu Val Val Cys Gly 

100 105 11° 

Ser Val His Gin Gly Val Cys Glu Gin Arg Arg Leu Gly Gin Leu Glu 

115 120 125 

Gin Leu Leu Leu Arg Pro Glu Arg Pro Gly Asp Thr Gin Tyr Val Ala 

130 135 140 

Ala Asn Asp Pro Ala Val Ser Thr Val Gly Leu Val Ala Gin Gly Leu 
145 150 155 160 

Ala Gly Glu Pro Leu Leu Phe Val Gly Arg Gly Tyr Thr Ser Arg Gly 

165 170 1 ?5 

Val Gly Gly Gly He Pro Pro He Thr Thr Arg Ala Leu Trp Pro Pro 

180 185 190 

Asp Pro Gin Ala Ala Phe Ser Tyr Glu Glu Thr Ala Lys Leu Ala Val 

195 200 205 

Gly Arg Leu Ser Glu Tyr Ser His His Phe Val Ser Ala Phe Ala Arg 

210 215 220 

Gly Ala Ser Ala Tyr Phe Leu Phe Leu Arg Arg Asp Leu Gin Ala Gin 
225 230 235 240 

Ser Arg Ala Phe Arg Ala Tyr Val Ser Arg Val Cys Leu Arg Asp Gin 

245 250 255 

His Tyr Tyr Ser Tyr Val Glu Leu Pro Leu Ala Cys Glu Gly Gly Arg 
260 265 270 
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Tyr Gly Leu lie Gin Ala Ala Ala Val Ala Thr Ser Arg Glu Val Ala 
275 280 285 

- His Gly Glu Val Leu Phe Ala Ala Phe Ser Ser Ala Ala Pro Pro Thr 
290 295 300 

Val Gly Arg Pro' Pro Ser Ala Ala Ala Gly Ala Ser Gly Ala Ser Ala 
305 310 315 320 

Leu Cys Ala Phe Pro Lieu Asp Glu Val Asp Arg Leu Ala Asn Arg Thr 

325 330 ■ 335 

Arg Asp Ala Cys Tyr Thr Arg Glu Gly Arg Ala Glu Asp Gly Thr Glu 

340 345 350 

Val Ala Tyr lie Glu Tyr Asp Val Asn Ser Asp Cys Ala Gin Leu Pro 

355 360 365 

Val Asp Thr Leu Asp Ala Tyr Pro Cys Gly Ser Asp His Thr Pro Ser 

370 375 380 

Pro Met Ala Ser Arg Val Pro Leu Glu Ala Thr Pro lie Leu Glu Trp 
385 390 395 400 

Pro Gly lie Gin Leu Thr Ala Val Ala Val Thr Met Glu Asp Gly His 

. 405 410 415 

Thr lie Ala Phe Leu Gly Asp Ser Gin Gly Gin Leu His Arg Val Tyr 

420 425 430 

Leu Gly Pro Gly Ser Asp Gly His Pro Tyr Ser Thr Gin Ser lie Gin 

435 440 445 

"Gin .Gly Ser Ala Val Ser Arg Asp Leu Thr Phe "Asp Gly Thr ~Phe Glu ~ 

450 455 . 460 

His Leu Tyr Val Met Thr Gin Ser Thr Leu Leu Lys Val Pro Val Ala 
465 470 475 480 

Ser Cys Ala Gin His Leu Asp Cys Ala Ser Cys Leu Ala His Arg Asp 

485 490 495 

Pro Tyr Cys Gly Trp Cys Val Leu Leu Gly Arg Cys Ser Arg Arg Ser 

500 505 510 

Glu Cys Ser Arg Gly Gin Gly Pro Glu Gin Trp Leu Trp Ser Phe Gin 

515 . 520 525 

Pro Glu Leu Gly Cys Leu Gin Val Ala Ala Met Ser Pro Ala Asn lie 

530 535 540 

Ser Arg Glu Glu Thr Arg Glu Val Phe Leu Ser Val Pro Asp Leu Pro 
545 550 555 560 

Pro Leu Trp Pro Gly Glu Ser Tyr Ser Cys His Phe Gly Glu His Gin 

565 570 575 

Ser Pro Ala Leu Leu Thr Gly Ser Gly Val Met Cys Pro Ser Pro Asp 

580 585 590 

Pro Ser Glu Ala Pro Val Leu Pro Arg Gly Ala Asp Tyr Val Ser Val 

595 600 605 

Ser Val Glu Leu Arg Phe Gly Ala Val Val lie Ala Lys Thr Ser Leu 

610 615 620 

Ser Phe Tyr Asp Cys Val Ala Val Thr Glu Leu Arg Pro Ser Ala Gin 
625 630 635 640 

Cys Gin Ala Cys Val Ser Ser Arg Trp Gly Cys Asn Trp Cys Val Trp 

645 650 655 

Gin His Leu Cys Thr His Lys Ala Ser Cys Asp Ala Gly Pro Met Val 

660 665 670 

Ala Ser His Gin Ser Pro Leu Val Ser Pro Asp Pro Pro Ala Arg Gly 

675 680 685 

Gly Pro Ser Pro Ser Pro Pro Thr Ala Pro Lys Ala Leu Ala Thr Pro 

690 695 700 

Ala Pro Asp Thr Leu Pro Val Glu Pro Gly Ala Pro Ser Thr Ala Thr 



- 24 - 



WO 01/14420 



PCT/US00/23365 



705 710 715 720 

Ala Ser Asp lie Ser Pro Gly Ala Ser Pro Ser Leu Leu Ser Pro Trp 

725 730 735 

Gly Pro Trp Ala Gly Ser Gly Ser lie Ser Ser Pro Gly Ser Thr Gly 

740 745 750 

Ser Pro Leu His Glu Glu Pro Ser Pro Pro Ser Pro Gin Asn Gly Pro 

755 760 765 

Gly Thr Ala Val Pro Ala Pro Thr Asp Phe Arg Pro Ser "Ala Thr Pro 

770 775 780 

Glu Asp Leu Leu Ala Ser Pro Leu Ser Pro Ser Glu Val Ala Ala Val 
785 790 795 800 

Pro Pro Ala Asp Pro Gly Pro Glu Ala Leu His Pro Thr Val Pro Leu 

805 810 815 

Asp Leu Pro Pro Ala Thr Val Pro Ala Thr Thr Phe Pro Gly Ala Met 

820 825 830 

Gly Ser Val Lys Pro Ala Leu Asp Trp Leu Thr Arg Glu Gly Gly Glu 

835 840 845 

Leu Pro Glu Ala Asp Glu Trp Thr Gly Gly Asp Ala Pro Ala Phe Ser 

850 855 860 
Thr Ser Thr Leu Leu Ser Gly Asp Gly Asp Ser Ala Glu Leu Glu Gly 
865 870 875 880 
Pro Pro Ala Pro Leu lie Leu Pro Ser Ser Leu Asp Tyr Gin Tyr Asp 
— ^ 8 - 90 - - 895 

Thr Pro Gly Leu Trp Glu Leu Glu Glu Ala Thr Leu Gly Ala Ser Ser 

900 905 910 

Cys Pro Cys Val Glu Ser Val Gin Gly Ser Thr Leu Met Pro Val His 

915 920 925 

Val Glu Arg Glu lie Arg Leu Leu Gly Arg Asn Leu Hjls Leu Phe Gin 

930 ~ 935 940 

Asp Gly Pro Gly Asp Asn Glu Cys Val Met Glu Leu Glu Gly Leu Glu 
945 950 955 960 

Val Val Val Glu Ala Arg Val Glu Cys Glu Pro Pro Pro Asp Thr Gin 

9 65 ~ 970 975 

Cys His Val Thr Cys Gin Gin His Gin Leu Ser Tyr Glu Ala Leu Gin 

980 985 990 

Pro Glu Leu Arg Val Gly Leu Phe Leu Arg Arg Ala Gly Arg Leu Arg 

995 1000 1005 

Val Asp Ser Ala Glu Gly Leu His Val Val Leu Tyr Asp Cys Ser Val 

1010 1015 1020 

Glv His Gly Asp Cys Ser Arg Cys Gin Thr Ala Met Pro Gin Tyr Gly 
1025 1030 1035 1040 

Cys Val Trp Cys Glu Gly Glu Arg Pro Arg Cys Val Thr Arg Glu Ala 

1045 1050 1055 

Cys Gly Glu Ala Glu Ala Val Ala Thr Gin Cys Pro Ala Pro Leu He 

1060 1065 1070 

His Ser Val Glu Pro Leu Thr Gly Pro Val Asp Gly Gly Thr Arg Val 

1075 1080 1085 

Thr He Arg Gly Ser Asn Leu Gly Gin His Val Gin Asp Val Leu Gly 

1090 1095 HOO 

Met Val Thr Val Ala Gly Val Pro Cys Ala Val Asp Ala Gin Glu Tyr 
1105 HIO 1H5 1120 

Glu Val Ser Ser Ser Leu Val Cys He Thr Gly Ala Ser Gly Glu Glu 

1125 H30 1135 

Val Ala Gly Ala Thr Ala Val Glu Val Pro Gly Arg Gly Arg Gly Val 
1140 H45 H50 
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Ser Glu His Asp Phe Ala Tyr Gin Asp Pro Lys Val His Ser lie Phe 

1155 1160 1165 

Pro Ala Arg Gly Pro Arg Ala Gly Gly Thr Arg Leu Thr Leu Asn Gly 

1170 1175 1180 

Ser Lys Leu Leu Thr Gly Arg Leu Glu Asp lie Arg Val Val Val Gly 
1185 1190 1195 1200 

Asp Gin Pro Cys His Leu Leu Pro Glu Gin Gin. Ser Glu Gin Leu Arg ' 

1205 " 1210 1215' 

Cys Glu Thr Ser Pro Arg Pro Thr Pro Ala Thr Leu Pro Val Ala Val 

1220 1225 1230 

Trp Phe Gly Ala Thr Glu Arg Arg Leu Gin Arg Gly Gin Phe Lys Tyr 

1235 1240 1245 

Thr Leu Asp Pro Asn lie Thr Ser Ala Gly Pro Thr Lys Ser Phe Leu 

1250 1255 1260 

Ser Gly Gly Arg Glu lie Cys Val Arg Gly Gin Asn Leu Asp Val Val 
1265 1270 1275 1280 

Gin Thr Pro Arg lie Arg Val Thr Val Val Ser Arg Met Leu Gin Pro 

1285 1290 1295 

Ser Gin Gly Leu Gly Arg Arg Arg Arg Val Val Pro Glu Thr Ala Cys 

1300 1305 ' 1310 

Ser Leu Gly Pro Ser Cys Ser Ser Gin Gin Phe Glu Glu Pro Cys His 

-131-5 - - 132 0- - ~ 1325 - 

Val Asn Ser Ser Gin Leu lie Thr Cys Arg Thr Pro Ala Leu Pro Gly 

1330 1335 1340 

Leu Pro Glu Asp Pro Trp Val Arg Val Glu Phe lie Leu Asp Asn Leu 
1345 1350 1355 1360 

Val Phe Asp Phe Ala Thr Leu Asn Pro Thr Pro Phe Ser Tyr Glu Ala 

1365 1370 1375 

Asp Pro Thr Leu Gin Pro Leu Asn Pro Glu Asp Pro Thr Met Pro . Phe 

1380 1385 1390 

Arg His Lys Pro Gly Ser Val Phe Ser Val Glu Gly Glu Asn Leu Asp 

1395 1400 1405 

Leu Ala Met Ser Lys Glu Glu Val Val Ala Met lie Gly Asp Gly Pro 

1410 1415 1420 

Cys Val Val Lys Thr Leu Thr Arg His His Leu Tyr Cys Glu Pro Pro 
1425 1430 1435 1440 

Val Glu Gin Pro Leu Pro Arg His His Ala Leu Arg Glu Ala Pro Asp 

1445 1450 - 1455 

Ser Leu Pro Glu Phe Thr Val Gin Met Gly Asn Leu Arg Phe Ser Leu 

1460 1465 1470 

Gly His Val Gin Tyr Asp Gly Glu Ser Pro Gly Ala Phe Pro Val Ala 

. 1475 1480 1485 

Ala Gin Val Gly Leu Gly Val Gly Thr Ser Leu Leu Ala Leu Gly Val 

1490 1495 1500 

lie lie lie Val Leu Met Tyr Arg Arg Lys Ser Lys Gin Ala Leu Arg 
1505 1510 1515 1520 

Asp Tyr Lys Lys Val Gin lie Gin Leu Glu Asn Leu Glu Ser Ser Val 

1525 1530 1535 

Arg Asp Arg Cys Lys Lys Glu Phe Thr Asp Leu Met Thr Glu Met Thr 

1540 1545 1550 

Asp Leu Thr Ser Asp Leu Leu Gly Ser Gly lie Pro Phe Leu Asp Tyr 

1555 1560. 1565 

Lys Val Tyr Ala Glu Arg lie Phe Phe Pro Gly His Arg Glu Ser Pro 

1570 1575 1580 

Leu His Arg Asp Leu Gly Val Pro Glu Ser Arg Arg Pro Thr Val Glu 
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1585 1590 1595 1600 

Gin Gly Leu Gly Gin Leu Ser Asn Leu Leu Asn Ser Lys Leu Phe Leu 

1605 1610 1615 

Thr Lys Phe He His Thr Leu Glu Ser Gin Arg Thr Phe Ser Ala Arg 

1620 1625 1630 

Asp Arg Ala Tyr Val Ala Ser Leu Leu Thr Val Ala Leu His Gly Lys 

1635 1640 1645 

Leu Glu Tyr Phe Thr Asp He Leu Arg Thr Leu Leu Ser Asp Leu Val 

1650 1655 1660 

Ala Gin Tyr Val Ala Lys Asn Pro Lys Leu Met Leu Arg Arg Thr Glu 
1665 1670 1675 1680 

Thr Val Val Glu Lys Leu Leu Thr Asn Trp Met Ser He Cys Leu Tyr 

1685 1690 1695 

Thr Phe Val Arg Asp Ser Val Gly Glu Pro Leu Tyr Met Leu Phe Arg 

1700 1705 1710 

Gly He Lys His Gin Val Asp Lys Gly Pro Val Asp Ser Val Thr Gly 

1715 1720 1725 

Lys Ala Lys Tyr Thr Leu Asn Asp Asn Arg Leu Leu Arg Glu Asp Val 

1730 ^ 1735 1740 

Glu Tyr Arg Pro Leu Thr Leu Asn Ala Leu Leu Ala Val Gly Pro Gly 
1745 "* 1750 1755 1760 

Ala -Glv -Glu Ala -Gin -Gly -Val Pro Val Lys Val Leu Asp Cys Asp -Thr 

1765 1770 1775 

He Ser Gin Ala Lys Glu Lys Met Leu Asp Gin Leu Tyr Lys Gly Val 

1780 1785 1790 

Pro Leu Thr Gin Arg Pro Asp Pro Arg Thr Leu Asp Val Glu Trp Arg 

1795 1800 1805 

Ser Gly Val Ala Gly His Leu He Leu Ser Asp Glu Asp Val Thr Ser 

1810 1815 1820 

Glu Val Gin Gly Leu Trp Arg Arg Leu Asn Thr Leu Gin His Tyr Lys 
1825 ~ 1830 1835 1840 

Val Pro Asp Gly Ala Thr Val Ala Leu Val Pro Cys Leu Thr Lys His 

1845 1850 1855 

Val Leu Arg Glu Asn Gin Asp Tyr Val Pro Gly Glu Arg Thr Pro Met 

I860 1865 1870 

Leu Glu Asp Val Asp Glu Gly Gly He Arg Pro Trp His Leu Val Lys 

1875 1880 1885 

Pro Ser Asp Glu Pro Glu Pro Pro Arg Pro Arg Arg Gly Ser Leu Arg 

1890 1895 1900 

Glv Glv Glu Arg Glu Arg Ala Lys Ala He Pro Glu He Tyr Leu Thr 
1905 1910 1915 1920 

Arg Leu Leu Ser Met Lys Gly Thr Leu Gin Lys Phe Val Asp Asp Leu 

1925 1930 1935 

Phe Gin Val He Leu Ser Thr Ser Arg Pro Val Pro Leu Ala Val Lys 

1940 1945 1950 

Tvr Phe Phe Asp Leu Leu Asp Glu Gin Ala Gin Gin His Gly He Ser 

y 1955 I960 1965 

Asp Gin Asp Thr He His He Trp Lys Thr Asn Ser Leu Pro Leu Arg 

1970 1975 1980 

Phe Trp He Asn He He Lys Asn Pro Gin Phe Val Phe Asp Val Gin 
1985 1990 1995 2000 

Thr Ser Asp Asn Met Asp Ala Val Leu Leu Val He Ala Gin Thr Phe 

2005 2010 2015 

Met Asp Ala Cys Thr Leu Ala Asp His Lys Leu Gly Arg Asp Ser Pro 
2020 2025 2030 
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lie Asn Lys Leu Leu Tyr Ala Arg Asp He Pro Arg Tyr Lys Arg Met • 

2035 2040 2045 

Val Glu Arg Tyr Tyr Ala Asp He Arg Gin Thr Val Pro Ala Ser Asp 

2050 2055 2060 

Gin Glu Met Asn Ser Val Leu Ala Glu Leu Ser Trp Asn Tyr Ser Gly 
2065 2070 2075 2080 

Asp Leu Gly Ala Arg Val Ala Leu His Glu Leu Tyr Lys. Tyr He Asn 

2085 2090 2095 

Lys Tyr Tyr Asp Gin lie* lie Thr Ala Leu Glu Glu Asp Gly Thr Ala 

2100 2105 2110 

Gin Lys Met Gin Leu Gly Tyr Arg Leu Gin Gin lie Ala Ala Ala Val 

2115 2120 " 2125 

Glu Asn Lys Val Thr Asp Leu 
2130 2135 

<210> 11 

<211> 2190 

<212> DMA 

<213> HOMO SAPIEN 

<400> 11 

atgcctgctc tgggcccagc Jtat.tc.tccag .gctc.tctggg _cc.gggtgg.g_t cc.tcac.cctc . 6_0 

cagccccttc caccaactgc attcactccc aatggcacgt atctgcagca cctggcaagg 120 

gaccccacct caggcaccct ctacctgggg gctaccaact tcctgttcca gctgagccct 180 

gorge tgcagc tggaggecac agtgtccacc ggccctgtgc tagacagcag ggactgcctg 240 

ccacctgtga tgcctgatga gtgcccccag gcccagccta ccaacaaccc gaatcagctg 300 

ctcctggtga gcccaggggc cctggtggta tgcgggagcg tgcaccaggg ggtctgtgaa 3 60 

cagcggcgcc tggggcagct cgagcagctg ctgctgcggc cagagcggcc tggggacaca 420 

caatatgtgg ctgccaatga tcctgcggtc agcacggtgg ggctggtagc ccagggcttg 480 

gcaggggagc ccctcctgtt tgtggggcga ggatacacca gcaggggtgt ggggggtggc 540 

attccaccca tcacaacccg ggccctgtgg ccgcccgacc cccaagctgc cttctcctat 600 

gaggagacag ccaagctggc agtgggccgc ctctccgagt acagccacca cttcgtgagt 660 

gcctttgcac gtggggccag cgcctacttc ctgttcctgc ggegggaect gcaggctcag 720 

tctagagctt ttcgtgccta tgtatctcga gtgtgtctcc gggaccagca ctactactcc 780 

tatgtggagt tgcctctggc ctgcgaaggt ggccgctacg ggctgatcca ggctgeaget 840 

gtggccacgt ccagggaggt ggcgcatggg gaggtgetet ttgcagcttt ctcctcggct 900 

gcacccccca ctgtgggccg gcccccatcg gcggctgctg gggcatctgg agcctctgcc 960 

ctctgtgcct tccccctgga tgaggtggac eggcttgeta atcgcacgcg agatgectge 1020 

tacacccggg agggtcgtgc tgaggatggg accgaggtgg cctacatcga gtatgatgtc 1080 

aattctgact gtgeacaget gccagtggac accctggatg cttatccctg tggctcagac 1140 

cacacgccca gccccatggc cagccgggtc ccgctggaag ccacaccaat tctggagtgg 1200 

ccagggattc agctaacagc tgtggcagtc accatggaag atggacacac catcgctttc 1260 

ctgggtgata gtcaagggca getgeacagg gtctacttgg geccagggag cgatggccac 1320 

ccatactcca cacagagcat ccagcagggg tctgcagtga gcagagacct cacctttgat 1380 

gggacctttg agcacctgta tgtcatgacc cagagcacac ttctgaaggt tcctgtggct 1440 

tcctgtgctc agcacctgga ctgtgcatct tgccttgctc acagggaccc atactgtggg 1500 

tggtgcgtgc tccttggcag gtgeagtege cgttctgagt gctcgagggg ccagggccca 1560 

gagcagtggc tatggagctt ccagcctgag ctgggctgtc tgcaagtggc agecatgagt 1620 

cctgccaaca teagecgaga ggagacgagg gaggttttcc tatcagtgee agacctgcca 1680 

cccctgtggc caggggagtc atattcctgc cactttgggg aacatcagag tcctgccctg 1740 

ctgactggtt ctggtgtgat gtgcccctcc ccagacccta gtgaggcccc agtgctgccg 1800 

agaggagecg actaegtate cgtgagcgtg gagctcagat ttggcgctgt tgtgatcgcc 1860 

aaaacttccc tctctttcta tgactgtgtg gcggtcactg aactccgccc atetgegcag 192 0 

tgccaggcct gtgtgagcag ccgctggggg tgtaactggt gtgtctggca gcacctgtgc 1980 

acccacaagg cctcgtgtga tgctgggccc atggttgcaa gecatcaggt gatggagact 2040 
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cagcagagct tgagggccct cccgcccccc tcatcctccc gtccagcctc gactaccagt 2100 
atgacacccc cgggctctgg gagctggaag aggcgacctt gggggcaagc tcctgcccct 2160 
gtgtggagag cgttcagggc tccacgttga 2190 



<210> 12 

<211> 729 

<212> PRT 

<213> HOMO SAPIEN 

<400> 12 
Met Pro A 
1 



35 

Leu Gly Ale 
50 

Glu Ala Thj 
65 



115 

Gin Leu Leu 
130 

Ala Asn Asp 
145 



195 

Gly Arg Leu 

210 
Gly Ala Ser 
225 



275 

His Gly Glu 
290 

Val Gly Arg 
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240 
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Leu 
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Glu 
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Gly 
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355 360 365 

Val Asp Thr Leu Asp Ala Tyr Pro Cys Gly Ser Asp His Thr Pro Ser 

370 375 380 

Pro Met Ala Ser Arg Val Pro Leu Glu Ala Thr Pro lie Leu Glu Trp 
385 390 395 400 

Pro Gly lie Gin Leu Thr Ala Val Ala Val Thr Met Glu Asp Gly His 

405 410 415 

Thr lie Ala Phe Leu Gly Asp Ser Gin Gly Gin Leu His Arg Val Tyr 

420 . 425 430 

Leu Gly Pro Gly Ser Asp Gly His Pro Tyr Ser Thr Gin Ser lie Gin 

435 440 445 

Gin Gly Ser Ala Val Ser Arg Asp Leu Thr Phe Asp Gly Thr Phe Glu 

450 455 460 

His Leu Tyr Val Met Thr Gin Ser Thr Leu Leu Lys Val Pro Val Ala 
465 470 475 480 

Ser Cys Ala Gin His Leu Asp Cys Ala Ser Cys Leu Ala His Arg Asp 

485 490 495 

Pro Tyr Cys Gly Trp Cys Val Leu Leu Gly Arg Cys Ser Arg Arg Ser 

500 505 510 

Glu Cys Ser Arg Gly Gin Gly Pro Glu Gin Trp Leu Trp Ser Phe Gin 

515 520 525 

Pro Glu -Leu Gly Cys Leu Gin -Val Ala Ala -Met Ser -Pro Ala Asn -lie 

530 535 540 

Ser Arg Glu Glu Thr Arg Glu Val Phe Leu Ser Val Pro Asp Leu Pro 
545 550 555 560 

Pro Leu Trp Pro Gly Glu Ser Tyr Ser Cys His Phe Gly Glu His Gin 

565 570 575 

Ser Pro Ala Leu Leu Thr Gly Ser Gly Val Met Cys Pro Ser Pro Asp 

580 585 590 

Pro Ser Glu Ala Pro Val Leu Pro Arg Gly Ala Asp Tyr Val Ser Val 

595 600 605 

Ser Val Glu Leu Arg Phe Gly Ala Val Val lie Ala Lys Thr Ser Leu 

610 615 620 

Ser Phe Tyr Asp Cys Val Ala Val Thr Glu Leu Arg Pro Ser Ala Gin 
625 630 635 640 

Cys Gin Ala Cys Val Ser Ser Arg Trp Gly Cys Asn Trp Cys Val Trp 

645 650 . 655 

Gin His Leu Cys Thr His Lys Ala Ser Cys Asp Ala Gly Pro Met Val 

660 665 670 

Ala Ser His Gin Val Met Glu Thr Gin Gin Ser Leu Arg Ala Leu Pro 

675 680 685 

Pro Pro Ser Ser Ser Arg Pro Ala Ser Thr Thr Ser Met Thr Pro Pro 

690 695 700 

Gly Ser Gly Ser Trp Lys Arg Arg Pro Trp Gly Gin Ala Pro Ala Pro 
705 710 715 720 

Val Trp Arg Ala Phe Arg Ala Pro Arg 

725 . '^h ■ / ; .< V - 
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